Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem (Paperback)
暫譯: Hadoop 2 快速入門指南:學習 Apache Hadoop 2 生態系統中的大數據計算基本要素 (平裝本)

Douglas Eadline

  • 出版商: Addison Wesley
  • 出版日期: 2015-11-05
  • 售價: $1,280
  • 貴賓價: 9.5$1,216
  • 語言: 英文
  • 頁數: 304
  • 裝訂: Paperback
  • ISBN: 0134049942
  • ISBN-13: 9780134049946
  • 相關分類: Hadoop大數據 Big-data
  • 立即出貨 (庫存=1)

買這商品的人也買了...

相關主題

商品描述

Get Started Fast with Apache Hadoop® 2, YARN, and Today’s Hadoop Ecosystem

 

With Hadoop 2.x and YARN, Hadoop moves beyond MapReduce to become practical for virtually any type of data processing. Hadoop 2.x and the Data Lake concept represent a radical shift away from conventional approaches to data usage and storage. Hadoop 2.x installations offer unmatched scalability and breakthrough extensibility that supports new and existing Big Data analytics processing methods and models.

 

Hadoop® 2 Quick-Start Guide is the first easy, accessible guide to Apache Hadoop 2.x, YARN, and the modern Hadoop ecosystem. Building on his unsurpassed experience teaching Hadoop and Big Data, author Douglas Eadline covers all the basics you need to know to install and use Hadoop 2 on personal computers or servers, and to navigate the powerful technologies that complement it.

 

Eadline concisely introduces and explains every key Hadoop 2 concept, tool, and service, illustrating each with a simple “beginning-to-end” example and identifying trustworthy, up-to-date resources for learning more.

 

This guide is ideal if you want to learn about Hadoop 2 without getting mired in technical details. Douglas Eadline will bring you up to speed quickly, whether you’re a user, admin, devops specialist, programmer, architect, analyst, or data scientist.

 

Coverage Includes

  • Understanding what Hadoop 2 and YARN do, and how they improve on Hadoop 1 with MapReduce
  • Understanding Hadoop-based Data Lakes versus RDBMS Data Warehouses
  • Installing Hadoop 2 and core services on Linux machines, virtualized sandboxes, or clusters
  • Exploring the Hadoop Distributed File System (HDFS)
  • Understanding the essentials of MapReduce and YARN application programming
  • Simplifying programming and data movement with Apache Pig, Hive, Sqoop, Flume, Oozie, and HBase
  • Observing application progress, controlling jobs, and managing workflows
  • Managing Hadoop efficiently with Apache Ambari–including recipes for HDFS to NFSv3 gateway, HDFS snapshots, and YARN configuration
  • Learning basic Hadoop 2 troubleshooting, and installing Apache Hue and Apache Spark

 

商品描述(中文翻譯)

快速入門 Apache Hadoop® 2、YARN 及當今的 Hadoop 生態系統

隨著 Hadoop 2.x 和 YARN 的推出,Hadoop 超越了 MapReduce,變得適用於幾乎任何類型的數據處理。Hadoop 2.x 和數據湖(Data Lake)概念代表了對傳統數據使用和存儲方法的根本轉變。Hadoop 2.x 的安裝提供了無與倫比的可擴展性和突破性的擴展性,支持新的和現有的大數據分析處理方法和模型。

Hadoop® 2 快速入門指南是第一本易於理解且可接觸的 Apache Hadoop 2.x、YARN 和現代 Hadoop 生態系統的指南。作者 Douglas Eadline 基於他在教授 Hadoop 和大數據方面的卓越經驗,涵蓋了安裝和使用 Hadoop 2 在個人電腦或伺服器上的所有基本知識,以及導航與之互補的強大技術。

Eadline 簡明扼要地介紹並解釋了每個關鍵的 Hadoop 2 概念、工具和服務,並用簡單的“從頭到尾”的範例來說明每一個概念,並指出值得信賴的、最新的學習資源。

如果您想了解 Hadoop 2 而不想陷入技術細節,這本指南非常理想。無論您是用戶、管理員、DevOps 專家、程式設計師、架構師、分析師或數據科學家,Douglas Eadline 都會迅速讓您掌握所需知識。

涵蓋內容包括:
- 理解 Hadoop 2 和 YARN 的功能,以及它們如何在 MapReduce 上改進 Hadoop 1
- 理解基於 Hadoop 的數據湖與 RDBMS 數據倉庫的區別
- 在 Linux 機器、虛擬沙盒或叢集上安裝 Hadoop 2 和核心服務
- 探索 Hadoop 分佈式文件系統(HDFS)
- 理解 MapReduce 和 YARN 應用程式編程的基本要素
- 使用 Apache Pig、Hive、Sqoop、Flume、Oozie 和 HBase 簡化編程和數據移動
- 觀察應用程式進度、控制作業和管理工作流程
- 使用 Apache Ambari 高效管理 Hadoop,包括 HDFS 到 NFSv3 閘道、HDFS 快照和 YARN 配置的食譜
- 學習基本的 Hadoop 2 故障排除,並安裝 Apache Hue 和 Apache Spark