Microsoft SQL Server 2012 with Hadoop
暫譯: Microsoft SQL Server 2012 與 Hadoop 整合
Debarchan Sarkar
- 出版商: Packt Publishing
- 出版日期: 2013-08-10
- 售價: $1,840
- 貴賓價: 9.5 折 $1,748
- 語言: 英文
- 頁數: 96
- 裝訂: Paperback
- ISBN: 1782177981
- ISBN-13: 9781782177982
-
相關分類:
Hadoop、MSSQL、SQL
海外代購書籍(需單獨結帳)
商品描述
Getting SQL Server talking to Hadoop is a smooth process when you follow this tutorial. Learn all the tools and techniques you need integrate the data and then extract powerful business insights from the merged result.
Overview
- Integrate data from unstructured (Hadoop) and structured (SQL Server 2012) sources
- Configure and install connectors for a bi-directional transfer of data
- Full of illustrations, diagrams, and tips with clear, step-by-step instructions and practical examples
In Detail
With the explosion of data, the open source Apache Hadoop ecosystem is gaining traction, thanks to its huge ecosystem that has arisen around the core functionalities of its distributed file system (HDFS) and Map Reduce. As of today, being able to have SQL Server talking to Hadoop has become increasingly important because the two are indeed complementary. While petabytes of unstructured data can be stored in Hadoop taking hours to be queried, terabytes of structured data can be stored in SQL Server 2012 and queried in seconds. This leads to the need to transfer and integrate data between Hadoop and SQL Server.
Microsoft SQL Server 2012 with Hadoop is aimed at SQL Server developers. It will quickly show you how to get Hadoop activated on SQL Server 2012 (it ships with this version). Once this is done, the book will focus on how to manage big data with Hadoop and use Hadoop Hive to query the data. It will also cover topics such as using in-memory functions by SQL Server and using tools for BI with big data.
Microsoft SQL Server 2012 with Hadoop focuses on data integration techniques between relational (SQL Server 2012) and non-relational (Hadoop) worlds. It will walk you through different tools for the bi-directional movement of data with practical examples.
You will learn to use open source connectors like SQOOP to import and export data between SQL Server 2012 and Hadoop, and to work with leading in-memory BI tools to create ETL solutions using the Hive ODBC driver for developing your data movement projects. Finally, this book will give you a glimpse of the present day self-service BI tools such as Excel and PowerView to consume Hadoop data and provide powerful insights on the data.
What you will learn from this book
- Use the Native SQOOP Connector for data movement between SQL Server 2012 and Hadoop
- Configure and use the Hive ODBC driver to enable any ODBC compliant client to consume Hadoop data
- Create ETL solutions and automate data movement jobs between SQL Server 2012 and Hadoop using SQL Server Integration Services
- Provide powerful reporting on the integrated data with just a matter of a few clicks using Microsoft self-service BI tools
- Merge structured and unstructured data together in a common warehouse for analysis, which is essential
Approach
This book will be a step-by-step tutorial, which practically teaches working with big data on SQL Server through sample examples in increasing complexity.
Who this book is written for
Microsoft SQL Server 2012 with Hadoop is specifically targeted at readers who want to cross-pollinate their Hadoop skills with SQL Server 2012 business intelligence and data analytics. A basic understanding of traditional RDBMS technologies and query processing techniques is essential.
商品描述(中文翻譯)
將 SQL Server 與 Hadoop 連接是一個順利的過程,只要您遵循本教程。學習您需要的所有工具和技術,以整合數據,然後從合併的結果中提取強大的商業洞察。
**概述**
- 整合來自非結構化(Hadoop)和結構化(SQL Server 2012)來源的數據
- 配置和安裝連接器以實現雙向數據傳輸
- 充滿插圖、圖表和提示,提供清晰的逐步指導和實用範例
**詳細內容**
隨著數據的爆炸性增長,開源的 Apache Hadoop 生態系統正逐漸受到重視,這要歸功於圍繞其分佈式檔案系統(HDFS)和 Map Reduce 的核心功能所形成的龐大生態系統。到目前為止,能夠讓 SQL Server 與 Hadoop 進行交互變得越來越重要,因為這兩者確實是互補的。雖然可以在 Hadoop 中存儲數 PB 的非結構化數據,查詢需要幾個小時,但可以在 SQL Server 2012 中存儲數 TB 的結構化數據,並在幾秒鐘內查詢。這導致了在 Hadoop 和 SQL Server 之間傳輸和整合數據的需求。
《Microsoft SQL Server 2012 與 Hadoop》針對 SQL Server 開發人員。它將快速向您展示如何在 SQL Server 2012 上啟用 Hadoop(此版本隨附此功能)。完成後,本書將重點介紹如何使用 Hadoop 管理大數據,並使用 Hadoop Hive 查詢數據。它還將涵蓋使用 SQL Server 的內存功能和使用大數據的 BI 工具等主題。
《Microsoft SQL Server 2012 與 Hadoop》專注於關聯(SQL Server 2012)和非關聯(Hadoop)世界之間的數據整合技術。它將引導您通過不同的工具進行雙向數據移動,並提供實用範例。
您將學會使用開源連接器如 SQOOP 在 SQL Server 2012 和 Hadoop 之間進行數據的導入和導出,並與領先的內存 BI 工具合作,使用 Hive ODBC 驅動程序創建 ETL 解決方案,以開發您的數據移動項目。最後,本書將讓您一窺當今的自助式 BI 工具,如 Excel 和 PowerView,以消費 Hadoop 數據並提供強大的數據洞察。
**您將從本書中學到什麼**
- 使用原生 SQOOP 連接器在 SQL Server 2012 和 Hadoop 之間進行數據移動
- 配置和使用 Hive ODBC 驅動程序,使任何 ODBC 相容的客戶端能夠消費 Hadoop 數據
- 使用 SQL Server 整合服務創建 ETL 解決方案並自動化 SQL Server 2012 和 Hadoop 之間的數據移動作業
- 僅需幾次點擊,即可使用 Microsoft 自助式 BI 工具對整合數據提供強大的報告
- 將結構化和非結構化數據合併到一個共同的數據倉庫中進行分析,這是至關重要的
**方法**
本書將是一個逐步的教程,通過逐漸增加複雜性的範例,實際教授如何在 SQL Server 上處理大數據。
**本書的讀者**
《Microsoft SQL Server 2012 與 Hadoop》專門針對希望將其 Hadoop 技能與 SQL Server 2012 商業智能和數據分析相結合的讀者。對傳統 RDBMS 技術和查詢處理技術的基本理解是必需的。