Snowflake Data Engineering
暫譯: Snowflake 數據工程
Ferle, Maja
- 出版商: Manning
- 出版日期: 2025-01-28
- 售價: $2,320
- 貴賓價: 9.5 折 $2,204
- 語言: 英文
- 頁數: 368
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1633436853
- ISBN-13: 9781633436855
海外代購書籍(需單獨結帳)
相關主題
商品描述
A practical introduction to data engineering on the powerful Snowflake cloud data platform. Data engineers create the pipelines that ingest raw data, transform it, and funnel it to the analysts and professionals who need it. The Snowflake cloud data platform provides a suite of productivity-focused tools and features that simplify building and maintaining data pipelines. In Snowflake Data Engineering, Snowflake Data Superhero Maja Ferle shows you how to get started. In Snowflake Data Engineering you will learn how to: - Ingest data into Snowflake from both cloud and local file systems
- Transform data using functions, stored procedures, and SQL
- Orchestrate data pipelines with streams and tasks, and monitor their execution
- Use Snowpark to run Python code in your pipelines
- Deploy Snowflake objects and code using continuous integration principles
- Optimize performance and costs when ingesting data into Snowflake Snowflake Data Engineering reveals how Snowflake makes it easy to work with unstructured data, set up continuous ingestion with Snowpipe, and keep your data safe and secure with best-in-class data governance features. Along the way, you'll practice the most important data engineering tasks as you work through relevant hands-on examples. Throughout, author Maja Ferle shares design tips drawn from her years of experience to ensure your pipeline follows the best practices of software engineering, security, and data governance. Foreword by Joe Reis. Purchase of the print book includes a free eBook in PDF and ePub formats from Manning Publications. About the technology Pipelines that ingest and transform raw data are the lifeblood of business analytics, and data engineers rely on Snowflake to help them deliver those pipelines efficiently. Snowflake is a full-service cloud-based platform that handles everything from near-infinite storage, fast elastic compute services, inbuilt AI/ML capabilities like vector search, text-to-SQL, code generation, and more. This book gives you what you need to create effective data pipelines on the Snowflake platform. About the book Snowflake Data Engineering guides you skill-by-skill through accomplishing on-the-job data engineering tasks using Snowflake. You'll start by building your first simple pipeline and then expand it by adding increasingly powerful features, including data governance and security, adding CI/CD into your pipelines, and even augmenting data with generative AI. You'll be amazed how far you can go in just a few short chapters! What's inside - Ingest data from the cloud, APIs, or Snowflake Marketplace
- Orchestrate data pipelines with streams and tasks
- Optimize performance and cost About the reader For software developers and data analysts. Readers should know the basics of SQL and the Cloud. About the author Maja Ferle is a Snowflake Subject Matter Expert and a Snowflake Data Superhero who holds the SnowPro Advanced Data Engineer and the SnowPro Advanced Data Analyst certifications. Table of Contents Part 1
1 Data engineering with Snowflake
2 Creating your first data pipeline
Part 2
3 Best practices for data staging
4 Transforming data
5 Continuous data ingestion
6 Executing code natively with Snowpark
7 Augmenting data with outputs from large language models
8 Optimizing query performance
9 Controlling costs
10 Data governance and access control
Part 3
11 Designing data pipelines
12 Ingesting data incrementally
13 Orchestrating data pipelines
14 Testing for data integrity and completeness
15 Data pipeline continuous integration
- Transform data using functions, stored procedures, and SQL
- Orchestrate data pipelines with streams and tasks, and monitor their execution
- Use Snowpark to run Python code in your pipelines
- Deploy Snowflake objects and code using continuous integration principles
- Optimize performance and costs when ingesting data into Snowflake Snowflake Data Engineering reveals how Snowflake makes it easy to work with unstructured data, set up continuous ingestion with Snowpipe, and keep your data safe and secure with best-in-class data governance features. Along the way, you'll practice the most important data engineering tasks as you work through relevant hands-on examples. Throughout, author Maja Ferle shares design tips drawn from her years of experience to ensure your pipeline follows the best practices of software engineering, security, and data governance. Foreword by Joe Reis. Purchase of the print book includes a free eBook in PDF and ePub formats from Manning Publications. About the technology Pipelines that ingest and transform raw data are the lifeblood of business analytics, and data engineers rely on Snowflake to help them deliver those pipelines efficiently. Snowflake is a full-service cloud-based platform that handles everything from near-infinite storage, fast elastic compute services, inbuilt AI/ML capabilities like vector search, text-to-SQL, code generation, and more. This book gives you what you need to create effective data pipelines on the Snowflake platform. About the book Snowflake Data Engineering guides you skill-by-skill through accomplishing on-the-job data engineering tasks using Snowflake. You'll start by building your first simple pipeline and then expand it by adding increasingly powerful features, including data governance and security, adding CI/CD into your pipelines, and even augmenting data with generative AI. You'll be amazed how far you can go in just a few short chapters! What's inside - Ingest data from the cloud, APIs, or Snowflake Marketplace
- Orchestrate data pipelines with streams and tasks
- Optimize performance and cost About the reader For software developers and data analysts. Readers should know the basics of SQL and the Cloud. About the author Maja Ferle is a Snowflake Subject Matter Expert and a Snowflake Data Superhero who holds the SnowPro Advanced Data Engineer and the SnowPro Advanced Data Analyst certifications. Table of Contents Part 1
1 Data engineering with Snowflake
2 Creating your first data pipeline
Part 2
3 Best practices for data staging
4 Transforming data
5 Continuous data ingestion
6 Executing code natively with Snowpark
7 Augmenting data with outputs from large language models
8 Optimizing query performance
9 Controlling costs
10 Data governance and access control
Part 3
11 Designing data pipelines
12 Ingesting data incrementally
13 Orchestrating data pipelines
14 Testing for data integrity and completeness
15 Data pipeline continuous integration
商品描述(中文翻譯)
實用的 Snowflake 雲端數據平台數據工程入門。
數據工程師創建管道,將原始數據攝取、轉換,並引導至需要這些數據的分析師和專業人士。Snowflake 雲端數據平台提供了一套以生產力為重點的工具和功能,簡化了數據管道的構建和維護。在 Snowflake Data Engineering 中,Snowflake 數據超級英雄 Maja Ferle 將向您展示如何入門。 在 Snowflake Data Engineering 中,您將學習如何: - 從雲端和本地檔案系統將數據攝取到 Snowflake- 使用函數、儲存過程和 SQL 轉換數據
- 使用流和任務協調數據管道,並監控其執行
- 使用 Snowpark 在您的管道中運行 Python 代碼
- 使用持續集成原則部署 Snowflake 對象和代碼
- 在將數據攝取到 Snowflake 時優化性能和成本 Snowflake Data Engineering 揭示了 Snowflake 如何輕鬆處理非結構化數據,使用 Snowpipe 設置持續攝取,並通過一流的數據治理功能確保您的數據安全。在此過程中,您將通過相關的實作範例練習最重要的數據工程任務。整本書中,作者 Maja Ferle 分享了她多年經驗中提煉的設計建議,以確保您的管道遵循軟體工程、安全性和數據治理的最佳實踐。 前言由 Joe Reis 撰寫。 購買印刷版書籍可獲得 Manning Publications 提供的免費 PDF 和 ePub 格式電子書。 關於技術 攝取和轉換原始數據的管道是商業分析的命脈,數據工程師依賴 Snowflake 來高效交付這些管道。Snowflake 是一個全方位的雲端平台,處理從近乎無限的儲存、快速的彈性計算服務、內建的 AI/ML 功能(如向量搜索、文本轉 SQL、代碼生成等)的一切。本書提供了您在 Snowflake 平台上創建有效數據管道所需的所有資訊。 關於本書 Snowflake Data Engineering 指導您逐步完成使用 Snowflake 的工作任務數據工程。您將從構建第一個簡單的管道開始,然後通過添加越來越強大的功能來擴展它,包括數據治理和安全性、將 CI/CD 整合到您的管道中,甚至使用生成式 AI 增強數據。您會驚訝於在短短幾個章節中您能走多遠! 內容概覽 - 從雲端、API 或 Snowflake Marketplace 攝取數據
- 使用流和任務協調數據管道
- 優化性能和成本 讀者對象 針對軟體開發人員和數據分析師。讀者應該了解 SQL 和雲端的基本知識。 關於作者 Maja Ferle 是 Snowflake 主題專家和 Snowflake 數據超級英雄,擁有 SnowPro 高級數據工程師和 SnowPro 高級數據分析師認證。 目錄 第一部分
1 使用 Snowflake 的數據工程
2 創建您的第一個數據管道
第二部分
3 數據暫存的最佳實踐
4 轉換數據
5 持續數據攝取
6 使用 Snowpark 原生執行代碼
7 使用大型語言模型的輸出增強數據
8 優化查詢性能
9 控制成本
10 數據治理和訪問控制
第三部分
11 設計數據管道
12 逐步攝取數據
13 協調數據管道
14 測試數據完整性和完整性
15 數據管道持續集成
作者簡介
Maja Ferle is a seasoned data architect with more than 30 years of experience in data analytics, data warehousing, business intelligence, data engineering, data modeling, and database administration. She holds the SnowPro Advanced Data Engineer and the SnowPro Advanced Data Analyst certifications. She is also a Snowflake Subject Matter Expert and a Snowflake Data Superhero.
作者簡介(中文翻譯)
Maja Ferle 是一位資深的數據架構師,擁有超過 30 年的數據分析、數據倉儲、商業智慧、數據工程、數據建模和資料庫管理經驗。她持有 SnowPro 進階數據工程師和 SnowPro 進階數據分析師的認證。她也是 Snowflake 的主題專家和 Snowflake 數據超級英雄。