Data Engineering with Google Cloud Platform - Second Edition: A guide to leveling up as a data engineer by building a scalable data platform with Goog
暫譯: 使用 Google Cloud Platform 的資料工程 - 第二版:透過建立可擴展的資料平台提升資料工程師技能的指南
Wijaya, Adi
- 出版商: Packt Publishing
- 出版日期: 2024-04-30
- 售價: $1,760
- 貴賓價: 9.5 折 $1,672
- 語言: 英文
- 頁數: 476
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1835080111
- ISBN-13: 9781835080115
-
相關分類:
Google Cloud、JVM 語言
海外代購書籍(需單獨結帳)
買這商品的人也買了...
-
$898Python and HDF5 (Paperback)
-
$1,710Learn Algorithmic Trading
-
$534$507 -
$2,300$2,185 -
$1,700$1,615 -
$1,628Kubeflow for Machine Learning: From Lab to Production
-
$588$559 -
$780$616 -
$1,950$1,853 -
$880$695 -
$1,800$1,710 -
$880$695 -
$1,960$1,862 -
$528$502 -
$621使用 GitOps 實現 Kubernetes 的持續部署:模式、流程及工具
-
$599$569 -
$539$512 -
$780$616 -
$1,750$1,663
相關主題
商品描述
Become a successful data engineer by building and deploying your own data pipelines on Google Cloud, including making key architectural decisions
Key Features- Get up to speed with data governance on Google Cloud
- Learn how to use various Google Cloud products like Dataform, DLP, Dataplex, Dataproc Serverless, and Datastream
- Boost your confidence by getting Google Cloud data engineering certification guidance from real exam experiences
- Purchase of the print or Kindle book includes a free PDF eBook
The second edition of Data Engineering with Google Cloud builds upon the success of the first edition by offering enhanced clarity and depth to data professionals navigating the intricate landscape of data engineering. Beyond its foundational lessons, this new edition delves into the essential realm of data governance within Google Cloud, providing you invaluable insights into managing and optimizing data resources effectively. Furthermore, this book helps you stay ahead of the curve by guiding you through the latest technological advancements in the Google Cloud ecosystem. You'll cover essential aspects, from exploring Cloud Composer 2 to the evolution of Airflow 2.5. Additionally, you'll explore how to work with cutting-edge tools like Dataform, DLP, Dataplex, Dataproc Serverless, and Datastream to perform data governance on datasets. By the end of this book, you'll be equipped to navigate the ever-evolving world of data engineering on Google Cloud, from foundational principles to cutting-edge practices.
What you will learn- Load data into BigQuery and materialize its output
- Focus on data pipeline orchestration using Cloud Composer
- Formulate Airflow jobs to orchestrate and automate a data warehouse
- Establish a Hadoop data lake, generate ephemeral clusters, and execute jobs on the Dataproc cluster
- Harness Pub/Sub for messaging and ingestion for event-driven systems
- Apply Dataflow to conduct ETL on streaming data
- Implement data governance services on Google Cloud
Data analysts, IT practitioners, software engineers, or any data enthusiasts looking to have a successful data engineering career will find this book invaluable. Additionally, experienced data professionals who want to start using Google Cloud to build data platforms will get clear insights on how to navigate the path. Whether you're a beginner who wants to explore the fundamentals or a seasoned professional seeking to learn the latest data engineering concepts, this book is for you.
Table of Contents- Fundamentals of Data engineering with GCP
- Big Data Capabilities on GCP
- Building a data warehouse in BigQuery
- Build Orchestration for Batch Data Loading Using Cloud Composer
- Building a Data Lake using Dataproc
- Process Streaming Data with Datastream, Pub/Sub and Dataflow
- Visualizing Data for Making Data-Driven Decisions with Looker Studio
- Build machine learning solutions on GCP
- User and Project Management on GCP
- Data Governance in GCP
- Cost Strategy in GCP
- CI/CD on Google Cloud Platform for Data Engineers
- Boost your confidence as a Data Engineer
商品描述(中文翻譯)
透過在 Google Cloud 上建立和部署自己的數據管道,成為成功的數據工程師,包括做出關鍵的架構決策主要特點
- 快速了解 Google Cloud 上的數據治理
- 學習如何使用各種 Google Cloud 產品,如 Dataform、DLP、Dataplex、Dataproc Serverless 和 Datastream
- 通過真實考試經驗獲得 Google Cloud 數據工程認證指導,提升自信心
- 購買印刷版或 Kindle 書籍包括免費 PDF 電子書
《Data Engineering with Google Cloud》的第二版在第一版的成功基礎上,為數據專業人士提供了更清晰和深入的內容,幫助他們在複雜的數據工程領域中導航。除了基礎課程外,這一新版深入探討了 Google Cloud 中數據治理的基本領域,為您提供有效管理和優化數據資源的寶貴見解。此外,本書還幫助您跟上最新的技術進展,指導您了解 Google Cloud 生態系統中的最新技術。您將涵蓋從探索 Cloud Composer 2 到 Airflow 2.5 的演變等重要方面。此外,您還將學習如何使用尖端工具,如 Dataform、DLP、Dataplex、Dataproc Serverless 和 Datastream,對數據集進行數據治理。閱讀完本書後,您將能夠在 Google Cloud 上駕馭不斷演變的數據工程世界,從基礎原則到尖端實踐。
您將學到的內容- 將數據加載到 BigQuery 並實現其輸出
- 專注於使用 Cloud Composer 進行數據管道編排
- 制定 Airflow 任務以編排和自動化數據倉庫
- 建立 Hadoop 數據湖,生成臨時集群,並在 Dataproc 集群上執行任務
- 利用 Pub/Sub 進行事件驅動系統的消息傳遞和攝取
- 應用 Dataflow 對流數據進行 ETL
- 在 Google Cloud 上實施數據治理服務
數據分析師、IT 從業人員、軟體工程師或任何希望在數據工程領域取得成功的數據愛好者都會發現本書非常有價值。此外,想要開始使用 Google Cloud 建立數據平台的經驗豐富的數據專業人士,將獲得清晰的指導,幫助他們導航這條道路。無論您是希望探索基礎知識的初學者,還是尋求學習最新數據工程概念的資深專業人士,本書都適合您。
目錄- GCP 數據工程基礎
- GCP 上的大數據能力
- 在 BigQuery 中建立數據倉庫
- 使用 Cloud Composer 建立批量數據加載的編排
- 使用 Dataproc 建立數據湖
- 使用 Datastream、Pub/Sub 和 Dataflow 處理流數據
- 使用 Looker Studio 可視化數據以做出數據驅動的決策
- 在 GCP 上建立機器學習解決方案
- 在 GCP 上進行用戶和項目管理
- GCP 中的數據治理
- GCP 中的成本策略
- 數據工程師的 Google Cloud Platform CI/CD
- 提升您作為數據工程師的自信心