Business Intelligence with Databricks SQL: Concepts, tools, and techniques for scaling business intelligence on the data lakehouse
暫譯: 使用 Databricks SQL 的商業智慧:在數據湖屋上擴展商業智慧的概念、工具和技術
Gupta, Vihag
- 出版商: Packt Publishing
- 出版日期: 2022-09-16
- 售價: $1,940
- 貴賓價: 9.5 折 $1,843
- 語言: 英文
- 頁數: 348
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1803235330
- ISBN-13: 9781803235332
-
相關分類:
SQL
海外代購書籍(需單獨結帳)
商品描述
Master critical skills needed to deploy and use Databricks SQL and elevate your BI from the warehouse to the lakehouse with confidence
Key Features:
- Learn about business intelligence on the lakehouse with features and functions of Databricks SQL
- Make the most of Databricks SQL by getting to grips with the enablers of its data warehousing capabilities
- A unique approach to teaching concepts and techniques with follow-along scenarios on real datasets
Book Description:
In this new era of data platform system design, data lakes and data warehouses are giving way to the lakehouse - a new type of data platform system that aims to unify all data analytics into a single platform. Databricks, with its Databricks SQL product suite, is the hottest lakehouse platform out there, harnessing the power of Apache Spark(TM), Delta Lake, and other innovations to enable data warehousing capabilities on the lakehouse with data lake economics.
This book is a comprehensive hands-on guide that helps you explore all the advanced features, use cases, and technology components of Databricks SQL. You'll start with the lakehouse architecture fundamentals and understand how Databricks SQL fits into it. The book then shows you how to use the platform, from exploring data, executing queries, building reports, and using dashboards through to learning the administrative aspects of the lakehouse - data security, governance, and management of the computational power of the lakehouse. You'll also delve into the core technology enablers of Databricks SQL - Delta Lake and Photon. Finally, you'll get hands-on with advanced SQL commands for ingesting data and maintaining the lakehouse.
By the end of this book, you'll have mastered Databricks SQL and be able to deploy and deliver fast, scalable business intelligence on the lakehouse.
What You Will Learn:
- Understand how Databricks SQL fits into the Databricks Lakehouse Platform
- Perform everyday analytics with Databricks SQL Workbench and business intelligence tools
- Organize and catalog your data assets
- Program the data security model to protect and govern your data
- Tune SQL warehouses (computing clusters) for optimal query experience
- Tune the Delta Lake storage format for maximum query performance
- Deliver extreme performance with the Photon query execution engine
- Implement advanced data ingestion patterns with Databricks SQL
Who this book is for:
This book is for business intelligence practitioners, data warehouse administrators, and data engineers who are new to Databrick SQL and want to learn how to deliver high-quality insights unhindered by the scale of data or infrastructure. This book is also for anyone looking to study the advanced technologies that power Databricks SQL. Basic knowledge of data warehouses, SQL-based analytics, and ETL processes is recommended to effectively learn the concepts introduced in this book and appreciate the innovation behind the platform.
商品描述(中文翻譯)
**掌握部署和使用 Databricks SQL 所需的關鍵技能,並自信地將您的商業智慧從數據倉庫提升至湖屋**
**主要特點:**
- 了解湖屋上的商業智慧及 Databricks SQL 的功能和特性
- 通過掌握其數據倉庫能力的促進因素,充分利用 Databricks SQL
- 獨特的教學方法,通過真實數據集的跟隨場景教授概念和技術
**書籍描述:**
在這個數據平台系統設計的新時代,數據湖和數據倉庫正逐漸讓位於湖屋——一種旨在將所有數據分析統一到單一平台的新型數據平台系統。Databricks 及其 Databricks SQL 產品套件是當前最熱門的湖屋平台,利用 Apache Spark(TM)、Delta Lake 和其他創新技術的力量,實現湖屋上的數據倉庫能力,並具備數據湖的經濟性。
本書是一本全面的實用指南,幫助您探索 Databricks SQL 的所有高級功能、使用案例和技術組件。您將從湖屋架構的基本原理開始,了解 Databricks SQL 如何融入其中。接著,本書將向您展示如何使用該平台,從數據探索、執行查詢、建立報告到使用儀表板,並學習湖屋的管理方面——數據安全、治理和計算能力的管理。您還將深入了解 Databricks SQL 的核心技術促進因素——Delta Lake 和 Photon。最後,您將實際操作高級 SQL 命令,以進行數據攝取和維護湖屋。
在本書結束時,您將掌握 Databricks SQL,並能夠在湖屋上部署和提供快速、可擴展的商業智慧。
**您將學到的內容:**
- 了解 Databricks SQL 如何融入 Databricks Lakehouse 平台
- 使用 Databricks SQL 工作台和商業智慧工具執行日常分析
- 組織和編目您的數據資產
- 編程數據安全模型以保護和治理您的數據
- 調整 SQL 倉庫(計算集群)以獲得最佳查詢體驗
- 調整 Delta Lake 存儲格式以獲得最大查詢性能
- 使用 Photon 查詢執行引擎提供極致性能
- 使用 Databricks SQL 實施高級數據攝取模式
**本書適合誰:**
本書適合商業智慧從業者、數據倉庫管理員和數據工程師,特別是那些對 Databricks SQL 新手,想學習如何在不受數據或基礎設施規模限制的情況下提供高質量見解的人。本書也適合任何希望研究驅動 Databricks SQL 的先進技術的人。建議具備基本的數據倉庫、基於 SQL 的分析和 ETL 流程知識,以有效學習本書介紹的概念並欣賞該平台背後的創新。