Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale (Paperback)

Thalpati, Gaurav Ashok

  • 出版商: O'Reilly
  • 出版日期: 2024-08-27
  • 售價: $2,450
  • 貴賓價: 9.5$2,328
  • 語言: 英文
  • 頁數: 283
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1098153014
  • ISBN-13: 9781098153014
  • 立即出貨 (庫存 < 3)

買這商品的人也買了...

商品描述

This concise yet comprehensive guide explains how to adopt a data lakehouse architecture to implement modern data platforms. It reviews the design considerations, challenges, and best practices for implementing a lakehouse and provides key insights into the ways that using a lakehouse can impact your data platform, from managing structured and unstructured data and supporting BI and AI/ML use cases to enabling more rigorous data governance and security measures.

Practical Lakehouse Architecture shows you how to:

  • Understand key lakehouse concepts and features like transaction support, time travel, and schema evolution
  • Understand the differences between traditional and lakehouse data architectures
  • Differentiate between various file formats and table formats
  • Design lakehouse architecture layers for storage, compute, metadata management, and data consumption
  • Implement data governance and data security within the platform
  • Evaluate technologies and decide on the best technology stack to implement the lakehouse for your use case
  • Make critical design decisions and address practical challenges to build a future-ready data platform
  • Start your lakehouse implementation journey and migrate data from existing systems to the lakehouse

商品描述(中文翻譯)

這本簡明而全面的指南解釋了如何採用數據湖屋架構來實現現代數據平台。它回顧了設計考量、挑戰和實施湖屋的最佳實踐,並提供了關鍵見解,說明使用湖屋如何影響您的數據平台,從管理結構化和非結構化數據、支持商業智慧(BI)和人工智慧/機器學習(AI/ML)用例,到促進更嚴格的數據治理和安全措施。

《實用湖屋架構》將教您如何:
- 理解湖屋的關鍵概念和特徵,如交易支持、時間旅行和模式演變
- 理解傳統數據架構與湖屋數據架構之間的差異
- 區分各種文件格式和表格格式
- 設計湖屋架構層以進行存儲、計算、元數據管理和數據消費
- 在平台內實施數據治理和數據安全
- 評估技術並決定最佳技術堆疊,以實現適合您用例的湖屋
- 做出關鍵設計決策並解決實際挑戰,以建立未來準備好的數據平台
- 開始您的湖屋實施之旅,並將數據從現有系統遷移到湖屋