相關主題
商品描述
Pro Microsoft HDInsight is a complete guide to deploying and using Apache Hadoop on the Microsoft Windows Azure Platforms. The information in this book enables you to process enormous volumes of structured as well as non-structured data easily using HDInsight, which is Microsoft’s own distribution of Apache Hadoop. Furthermore, the blend of Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) offerings available through Windows Azure lets you take advantage of Hadoop’s processing power without the worry of creating, configuring, maintaining, or managing your own cluster.
With the data explosion that is soon to happen, the open source Apache Hadoop Framework is gaining traction, and it benefits from a huge ecosystem that has risen around the core functionalities of the Hadoop distributed file system (HDFS™) and Hadoop Map Reduce. Pro Microsoft HDInsight equips you with the knowledge, confidence, and technique to configure and manage this ecosystem on Windows Azure. The book is an excellent choice for anyone aspiring to be a data scientist or data engineer, putting you a step ahead in the data mining field.
- Guides you through installation and configuration of an HDInsight cluster on Windows Azure
- Provides clear examples of configuring and executing Map Reduce jobs
- Helps you consume data and diagnose errors from the Windows Azure HDInsight Service
商品描述(中文翻譯)
《Pro Microsoft HDInsight》是一本完整的指南,介紹如何在 Microsoft Windows Azure 平台上部署和使用 Apache Hadoop。本書中的資訊使您能夠輕鬆地使用 HDInsight 處理大量的結構化和非結構化數據,HDInsight 是微軟自家的 Apache Hadoop 發行版。此外,透過 Windows Azure 提供的基礎設施即服務 (IaaS) 和平台即服務 (PaaS) 的結合,讓您能夠利用 Hadoop 的處理能力,而無需擔心建立、配置、維護或管理自己的叢集。
隨著即將到來的數據爆炸,開源的 Apache Hadoop 框架正逐漸受到重視,並受益於圍繞 Hadoop 分散式檔案系統 (HDFS™) 和 Hadoop Map Reduce 的核心功能所形成的龐大生態系統。《Pro Microsoft HDInsight》為您提供了在 Windows Azure 上配置和管理這個生態系統所需的知識、自信和技術。本書是任何希望成為數據科學家或數據工程師的人的絕佳選擇,讓您在數據挖掘領域中領先一步。
- 指導您在 Windows Azure 上安裝和配置 HDInsight 叢集
- 提供清晰的範例來配置和執行 Map Reduce 任務
- 幫助您從 Windows Azure HDInsight 服務中消耗數據並診斷錯誤