HDInsight Essentials, 2/e (Paperback)
暫譯: HDInsight 基礎知識,第二版(平裝本)
Rajesh Nadipalli
- 出版商: Packt Publishing
- 出版日期: 2015-01-29
- 售價: $1,660
- 貴賓價: 9.5 折 $1,577
- 語言: 英文
- 頁數: 150
- 裝訂: Paperback
- ISBN: 1784399426
- ISBN-13: 9781784399429
海外代購書籍(需單獨結帳)
相關主題
商品描述
Learn how to build and deploy a modern big data architecture to empower your business
About This Book
- Learn how to quickly provision a Hadoop cluster using Windows Azure Cloud Services
- Build an end-to-end application for a big data problem using open source software
- Discover more about modern data architecture with this guide, to help you understand the transition from legacy relational Enterprise Data Warehouse
Who This Book Is For
If you want to discover one of the latest tools designed to produce stunning Big Data insights, this book features everything you need to get to grips with your data. Whether you are a data architect, developer, or a business strategist, HDInsight adds value in everything from development, administration, and reporting.
What You Will Learn
- Explore core features of Hadoop, including the HDFS2 and YARN, the new resource manager for Hadoop
- Build your HDInsight cluster in minutes and learn how to administer it using Azure PowerShell
- Discover what's new in Hadoop 2.X and the reference architecture for a modern data lake based on Hadoop
- Find out more about a data lake vision and its core capabilities
- Ingest and organize your data into HDInsight
- Utilize open source software to transform data including Hive, Pig, and MapReduce, and make it available for decision makers
- Get to grips with architectural considerations for scalability, maintainability, and security
In Detail
Traditional relational databases are today ineffective with dealing with the challenges presented by Big Data. A Hadoop-based architecture offers a radical solution, as it is designed specifically to handle huge sets of unstructured data.
This book takes you through the journey of building a modern data lake architecture using HDInsight, a Hadoop-based service that allows you to successfully manage high volume and velocity data in the Microsoft Azure Cloud. Featuring a wealth of practical examples, you'll find tips and techniques to provision your own HDInsight cluster to ingest, organize, transform, and analyze data.
While guided through HDInsight, you'll explore the wider Hadoop ecosystem with plenty of working examples on Hadoop technologies including Hive, Pig, MapReduce, HBase, Storm, and analytics solutions including using Excel PowerQuery, PowerMap, and PowerBI.
商品描述(中文翻譯)
**學習如何建立和部署現代大數據架構,以提升您的業務**
## 本書介紹
- 學習如何快速配置使用 Windows Azure Cloud Services 的 Hadoop 叢集
- 使用開源軟體為大數據問題構建端到端應用程式
- 通過本指南深入了解現代數據架構,幫助您理解從傳統關聯企業數據倉庫的轉變
## 本書適合誰
如果您想發現最新的工具,以產生驚人的大數據洞察,本書提供了您掌握數據所需的一切。無論您是數據架構師、開發人員還是商業策略師,HDInsight 在開發、管理和報告等各方面都能增值。
## 您將學到什麼
- 探索 Hadoop 的核心功能,包括 HDFS2 和 YARN,Hadoop 的新資源管理器
- 在幾分鐘內建立您的 HDInsight 叢集,並學習如何使用 Azure PowerShell 進行管理
- 發現 Hadoop 2.X 的新特性以及基於 Hadoop 的現代數據湖參考架構
- 了解數據湖的願景及其核心能力
- 將數據導入並組織到 HDInsight 中
- 利用開源軟體轉換數據,包括 Hive、Pig 和 MapReduce,並使其可供決策者使用
- 理解可擴展性、可維護性和安全性的架構考量
## 詳細內容
傳統的關聯數據庫在處理大數據所帶來的挑戰時已經變得無效。基於 Hadoop 的架構提供了一個根本性的解決方案,因為它專門設計用來處理大量的非結構化數據。
本書將帶您走過使用 HDInsight 建立現代數據湖架構的旅程,HDInsight 是一項基於 Hadoop 的服務,允許您在 Microsoft Azure Cloud 中成功管理高容量和高速度的數據。書中包含大量實用範例,您將找到配置自己的 HDInsight 叢集以導入、組織、轉換和分析數據的技巧和技術。
在引導您使用 HDInsight 的同時,您將探索更廣泛的 Hadoop 生態系統,並提供許多有關 Hadoop 技術的實作範例,包括 Hive、Pig、MapReduce、HBase、Storm,以及使用 Excel PowerQuery、PowerMap 和 PowerBI 的分析解決方案。