Polybase Revealed: Data Virtualization with SQL Server, Hadoop, Apache Spark, and Beyond (Paperback)
暫譯: Polybase 揭密:使用 SQL Server、Hadoop、Apache Spark 等進行資料虛擬化 (平裝本)

Feasel, Kevin

  • 出版商: Apress
  • 出版日期: 2019-12-21
  • 售價: $1,100
  • 貴賓價: 9.5$1,045
  • 語言: 英文
  • 頁數: 311
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1484254600
  • ISBN-13: 9781484254608
  • 相關分類: HadoopMSSQLSparkSQL
  • 立即出貨 (庫存=1)

買這商品的人也買了...

相關主題

商品描述

Harness the power of PolyBase data virtualization software to make data from a variety of sources easily accessible through SQL queries while using the T-SQL skills you already know and have mastered.

PolyBase Revealed shows you how to use the PolyBase feature of SQL Server 2019 to integrate SQL Server with Azure Blob Storage, Apache Hadoop, other SQL Server instances, Oracle, Cosmos DB, Apache Spark, and more. You will learn how PolyBase can help you reduce storage and other costs by avoiding the need for ETL processes that duplicate data in order to make it accessible from one source. PolyBase makes SQL Server into that one source, and T-SQL is your golden ticket. The book also covers PolyBase scale-out clusters, allowing you to distribute PolyBase queries among several SQL Server instances, thus improving performance.

With great flexibility comes great complexity, and this book shows you where to look when queries fail, complete with coverage of internals, troubleshooting techniques, and where to find more information on obscure cross-platform errors. Data virtualization is a key target for Microsoft with SQL Server 2019. This book will help you keep your skills current, remain relevant, and build new business and career opportunities around Microsoft’s product direction.

 

What You Will Learn

  • Install and configure PolyBase as a stand-alone service, or unlock its capabilities with a scale-out cluster
  • Understand how PolyBase interacts with outside data sources while presenting their data as regular SQL Server tables
  • Write queries combining data from SQL Server, Apache Hadoop, Oracle, Cosmos DB, Apache Spark, and more
  • Troubleshoot PolyBase queries using SQL Server Dynamic Management Views
  • Tune PolyBase queries using statistics and execution plans
  • Solve common business problems, including "cold storage" of infrequently accessed data and simplifying ETL jobs

商品描述(中文翻譯)

利用 PolyBase 數據虛擬化軟體的強大功能,通過您已經熟悉並掌握的 T-SQL 技能,輕鬆地通過 SQL 查詢訪問來自各種來源的數據。

PolyBase 揭密 向您展示如何使用 SQL Server 2019 的 PolyBase 功能,將 SQL Server 與 Azure Blob Storage、Apache Hadoop、其他 SQL Server 實例、Oracle、Cosmos DB、Apache Spark 等進行整合。您將學習到 PolyBase 如何幫助您降低存儲和其他成本,避免需要 ETL 流程來重複數據,以便從一個來源訪問數據。PolyBase 將 SQL Server 變成那個唯一的來源,而 T-SQL 是您的金鑰。這本書還涵蓋了 PolyBase 擴展集群,讓您能夠在多個 SQL Server 實例之間分配 PolyBase 查詢,從而提高性能。

隨著極大的靈活性而來的是極大的複雜性,這本書將告訴您在查詢失敗時應該查看的地方,並詳細介紹內部結構、故障排除技術,以及在何處找到有關不明跨平台錯誤的更多信息。數據虛擬化是微軟在 SQL Server 2019 中的一個關鍵目標。這本書將幫助您保持技能的現代性,保持相關性,並圍繞微軟的產品方向建立新的商業和職業機會。

 

您將學到什麼


  • 安裝和配置 PolyBase 作為獨立服務,或通過擴展集群解鎖其功能

  • 了解 PolyBase 如何與外部數據來源互動,同時將其數據呈現為常規的 SQL Server 表

  • 撰寫查詢,結合來自 SQL Server、Apache Hadoop、Oracle、Cosmos DB、Apache Spark 等的數據

  • 使用 SQL Server 動態管理視圖排除 PolyBase 查詢的故障

  • 使用統計信息和執行計劃調整 PolyBase 查詢

  • 解決常見的商業問題,包括對不常訪問數據的「冷存儲」以及簡化 ETL 工作

作者簡介

Kevin Feasel is a Microsoft Data Platform MVP and CTO at Envizage where he specializes in T-SQL and R development, forcing Spark clusters to do his bidding, fighting with Kafka, and pulling rabbits out of hats on demand. He is the lead curator at Curated SQL (curatedsql.com). A resident of Durham, North Carolina, USA, Kevin can be found cycling the trails along the Triangle whenever the weather is nice enough.

 

 

 

作者簡介(中文翻譯)

Kevin Feasel 是微軟數據平台的 MVP 及 Envizage 的首席技術官,他專注於 T-SQL 和 R 開發,讓 Spark 集群聽從他的指揮,與 Kafka 進行鬥爭,並隨時變出驚喜。他是 Curated SQL (curatedsql.com) 的首席策展人。Kevin 住在美國北卡羅來納州的達勒姆,當天氣好時,他經常在三角地區的步道上騎自行車。