Data Engineering with dbt: A practical guide to building a cloud-based, pragmatic, and dependable data platform with SQL
暫譯: 使用 dbt 的數據工程:構建基於雲端、實用且可靠的數據平台的實用指南,搭配 SQL

Zagni, Roberto

  • 出版商: Packt Publishing
  • 出版日期: 2023-06-30
  • 售價: $2,460
  • 貴賓價: 9.5$2,337
  • 語言: 英文
  • 頁數: 578
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1803246286
  • ISBN-13: 9781803246284
  • 相關分類: SQL
  • 海外代購書籍(需單獨結帳)

商品描述

Use easy-to-apply patterns in SQL and Python to adopt modern analytics engineering to build agile platforms with dbt that are well-tested and simple to extend and run Purchase of the print or Kindle book includes a free PDF eBook


Key Features:

  • Build a solid dbt base and learn data modeling and the modern data stack to become an analytics engineer
  • Build automated and reliable pipelines to deploy, test, run, and monitor ELTs with dbt Cloud
  • Guided dbt + Snowflake project to build a pattern-based architecture that delivers reliable datasets


Book Description:

dbt Cloud helps professional analytics engineers automate the application of powerful and proven patterns to transform data from ingestion to delivery, enabling real DataOps.

This book begins by introducing you to dbt and its role in the data stack, along with how it uses simple SQL to build your data platform, helping you and your team work better together. You'll find out how to leverage data modeling, data quality, master data management, and more to build a simple-to-understand and future-proof solution. As you advance, you'll explore the modern data stack, understand how data-related careers are changing, and see how dbt enables this transition into the emerging role of an analytics engineer. The chapters help you build a sample project using the free version of dbt Cloud, Snowflake, and GitHub to create a professional DevOps setup with continuous integration, automated deployment, ELT run, scheduling, and monitoring, solving practical cases you encounter in your daily work.

By the end of this dbt book, you'll be able to build an end-to-end pragmatic data platform by ingesting data exported from your source systems, coding the needed transformations, including master data and the desired business rules, and building well-formed dimensional models or wide tables that'll enable you to build reports with the BI tool of your choice.


What You Will Learn:

  • Create a dbt Cloud account and understand the ELT workflow
  • Combine Snowflake and dbt for building modern data engineering pipelines
  • Use SQL to transform raw data into usable data, and test its accuracy
  • Write dbt macros and use Jinja to apply software engineering principles
  • Test data and transformations to ensure reliability and data quality
  • Build a lightweight pragmatic data platform using proven patterns
  • Write easy-to-maintain idempotent code using dbt materialization


Who this book is for:

This book is for data engineers, analytics engineers, BI professionals, and data analysts who want to learn how to build simple, futureproof, and maintainable data platforms in an agile way. Project managers, data team managers, and decision makers looking to understand the importance of building a data platform and foster a culture of high-performing data teams will also find this book useful. Basic knowledge of SQL and data modeling will help you get the most out of the many layers of this book. The book also includes primers on many data-related subjects to help juniors get started.

商品描述(中文翻譯)

使用易於應用的 SQL 和 Python 模式,採用現代分析工程,利用 dbt 建立靈活的平台,這些平台經過良好測試且易於擴展和運行。購買印刷版或 Kindle 版書籍包括免費 PDF 電子書。


主要特點:


  • 建立穩固的 dbt 基礎,學習數據建模和現代數據堆疊,成為分析工程師

  • 建立自動化和可靠的管道,以使用 dbt Cloud 部署、測試、運行和監控 ELT

  • 指導 dbt + Snowflake 專案,建立基於模式的架構,提供可靠的數據集


書籍描述:

dbt Cloud 幫助專業的分析工程師自動化應用強大且經過驗證的模式,將數據從攝取轉換到交付,實現真正的 DataOps。

本書首先介紹 dbt 及其在數據堆疊中的角色,以及如何使用簡單的 SQL 來構建您的數據平台,幫助您和您的團隊更好地協作。您將了解如何利用數據建模、數據質量、主數據管理等來構建一個易於理解且未來可持續的解決方案。隨著進展,您將探索現代數據堆疊,了解與數據相關的職業如何變化,並看到 dbt 如何促進這一轉變,進入分析工程師的新興角色。各章節幫助您使用免費版本的 dbt Cloud、Snowflake 和 GitHub 建立一個範例專案,創建專業的 DevOps 設置,實現持續集成、自動部署、ELT 運行、排程和監控,解決您在日常工作中遇到的實際案例。

在本書結束時,您將能夠通過攝取從源系統導出的數據、編寫所需的轉換,包括主數據和所需的業務規則,並構建良好格式的維度模型或寬表,來建立一個端到端的務實數據平台,這將使您能夠使用您選擇的 BI 工具生成報告。


您將學到什麼:


  • 創建 dbt Cloud 帳戶並了解 ELT 工作流程

  • 結合 Snowflake 和 dbt 來構建現代數據工程管道

  • 使用 SQL 將原始數據轉換為可用數據,並測試其準確性

  • 編寫 dbt 宏並使用 Jinja 應用軟體工程原則

  • 測試數據和轉換以確保可靠性和數據質量

  • 使用經過驗證的模式構建輕量級的務實數據平台

  • 使用 dbt 物化編寫易於維護的冪等代碼


本書適合誰:

本書適合希望學習如何以敏捷方式構建簡單、未來可持續且可維護的數據平台的數據工程師、分析工程師、BI 專業人士和數據分析師。項目經理、數據團隊經理和希望了解構建數據平台的重要性並促進高效能數據團隊文化的決策者也會發現本書有用。具備基本的 SQL 和數據建模知識將幫助您充分利用本書的多層內容。本書還包括許多與數據相關主題的入門介紹,以幫助初學者入門。