Data Engineering with Databricks Cookbook: Build effective data and AI solutions using Apache Spark, Databricks, and Delta Lake
暫譯: Databricks 食譜：使用 Apache Spark、Databricks 和 Delta Lake 建立有效的數據與 AI 解決方案

Name: Data Engineering with Databricks Cookbook: Build effective data and AI solutions using Apache Spark, Databricks, and Delta Lake
Price: 1824 TWD
Availability: InStock
Author: Chadha, Pulkit
ISBN: 1837633355

Chadha, Pulkit

出版商: Packt Publishing
出版日期: 2024-05-31
售價: $1,920
貴賓價: 9.5 折 $1,824
語言: 英文
頁數: 438
裝訂: Quality Paper - also called trade paper
ISBN: 1837633355
ISBN-13: 9781837633357
相關分類: Spark

立即出貨 (庫存=1)

買這商品的人也買了...

~~$790~~ $774

基礎資料結構 ─ 使用 C++ (Fundamentals of Data Structures in C++, 2/e)
~~$650~~ $514

資料庫理論與實務 Access 2007
~~$940~~ $700

無瑕的程式碼－敏捷軟體開發技巧守則 + 番外篇－專業程式設計師的生存之道 (雙書合購)
~~$780~~ $616

精通 Python｜運用簡單的套件進行現代運算 (Introducing Python: Modern Computing in Simple Packages)
$250

OpenCV 3 計算機視覺 : Python 語言實現, 2/e (Learning OpenCV 3 Computer Vision with Python, 2/e)
~~$550~~ $435

學會 Python - 從不懂，到玩上手！
~~$790~~ $672

無瑕的程式碼－敏捷完整篇－物件導向原則、設計模式與 C# 實踐 (Agile principles, patterns, and practices in C#)
~~$590~~ $460

TensorFlow + Keras 深度學習人工智慧實務應用
~~$520~~ $411

Soft Skills 軟實力｜軟體開發人員的生存手冊 (Soft Skills: The software developer's life manual)
~~$580~~ $452

無瑕的程式碼－整潔的軟體設計與架構篇 (Clean Architecture: A Craftsman's Guide to Software Structure and Design)
~~$210~~ $200

人工智能基礎 (高中版)
~~$680~~ $578

領域驅動設計：軟體核心複雜度的解決方法 (Domain-Driven Design: Tackling Complexity in the Heart of Software)
~~$69~~ $60

I'm From Taiwan / Programmer 阿喵宅造型貼紙7X7公分 (粉色)
~~$880~~ $695

深入淺出 Go (Head First Go)
~~$68~~ $68

阿喵宅開發順利春聯 2入
~~$520~~ $468

白話演算法！培養程式設計的邏輯思考 (Grokking Algorithms: An illustrated guide for programmers and other curious people)
~~$880~~ $695

比 Docker 再高階一步：使用 Harbor 完成 Helm Chart 容器及鏡像雲端原生管理
~~$880~~ $695

超圖解 ESP32 深度實作
~~$599~~ $473

資料科學的建模基礎 : 別急著 coding！你知道模型的陷阱嗎？
~~$400~~ $360

人工智慧應用在我家 - 使用 KNERON AI Dongle(耐能AI加速棒) - 附 MOSME 行動學習一點通：診斷．評量．影音．擴增．加值
~~$520~~ $442

Final Cut Pro 職人剪片全攻略：一台 Mac 包辦影音剪輯、素材處理、調色技巧，打造流暢的高質感影片！
~~$768~~ $730

RHCSA / RHCE8 紅帽 Linux 認證學習教程
~~$450~~ $351

超實用！會計．生管．財務的辦公室 EXCEL 必備 50招省時技 (2016/2019/2021) (暢銷回饋版)
~~$599~~ $569

Final Cut Pro 視頻後期剪輯零基礎入門到精通
~~$888~~ $844

C++ 高性能編程

商品描述

Work through 70 recipes for implementing reliable data pipelines with Apache Spark, optimally store and process structured and unstructured data in Delta Lake, and use Databricks to orchestrate and govern your data

Key Features

Learn data ingestion, data transformation, and data management techniques using Apache Spark and Delta Lake
Gain practical guidance on using Delta Lake tables and orchestrating data pipelines
Implement reliable DataOps and DevOps practices, and enforce data governance policies on Databricks
Purchase of the print or Kindle book includes a free PDF eBook

Book Description

Data Engineering with Databricks Cookbook will guide you through recipes to effectively use Apache Spark, Delta Lake, and Databricks for data engineering, beginning with an introduction to data ingestion and loading with Apache Spark.

As you progress, you'll be introduced to various data manipulation and data transformation solutions that can be applied to data. You'll find out how to manage and optimize Delta tables, as well as how to ingest and process streaming data. The book will also show you how to improve the performance problems of Apache Spark apps and Delta Lake. Later chapters will show you how to use Databricks to implement DataOps and DevOps practices and teach you how to orchestrate and schedule data pipelines using Databricks Workflows. Finally, you'll understand how to set up and configure Unity Catalog for data governance.

By the end of this book, you'll be well-versed in building reliable and scalable data pipelines using modern data engineering technologies.

What you will learn

Perform data loading, ingestion, and processing with Apache Spark
Discover data transformation techniques and custom user-defined functions (UDFs) in Apache Spark
Manage and optimize Delta tables with Apache Spark and Delta Lake APIs
Use Spark Structured Streaming for real-time data processing
Optimize Apache Spark application and Delta table query performance
Implement DataOps and DevOps practices on Databricks
Orchestrate data pipelines with Delta Live Tables and Databricks Workflows
Implement data governance policies with Unity Catalog

Who this book is for

This book is for data engineers, data scientists, and data practitioners who want to learn how to build efficient and scalable data pipelines using Apache Spark, Delta Lake, and Databricks. To get the most out of this book, you should have basic knowledge of data architecture, SQL, and Python programming.

商品描述(中文翻譯)

透過 70 個食譜學習如何使用 Apache Spark 實現可靠的數據管道，最佳化存儲和處理 Delta Lake 中的結構化和非結構化數據，並使用 Databricks 來協調和管理您的數據

主要特點

學習使用 Apache Spark 和 Delta Lake 的數據攝取、數據轉換和數據管理技術

獲得有關使用 Delta Lake 表和協調數據管道的實用指導

實施可靠的 DataOps 和 DevOps 實踐，並在 Databricks 上強制執行數據治理政策

購買印刷版或 Kindle 書籍包括免費 PDF 電子書

書籍描述

《Databricks 食譜中的數據工程》將指導您通過食譜有效使用 Apache Spark、Delta Lake 和 Databricks 進行數據工程，首先介紹使用 Apache Spark 進行數據攝取和加載。

隨著進展，您將接觸到各種可以應用於數據的數據操作和數據轉換解決方案。您將了解如何管理和優化 Delta 表，以及如何攝取和處理流數據。本書還將展示如何改善 Apache Spark 應用程序和 Delta Lake 的性能問題。後面的章節將告訴您如何使用 Databricks 實施 DataOps 和 DevOps 實踐，並教您如何使用 Databricks Workflows 協調和排程數據管道。最後，您將了解如何設置和配置 Unity Catalog 以進行數據治理。

在本書結束時，您將熟練掌握使用現代數據工程技術構建可靠且可擴展的數據管道。

您將學到的內容