Mastering Hadoop
暫譯: 精通Hadoop

Sandeep Karanth

  • 出版商: Packt Publishing
  • 出版日期: 2014-12-31
  • 售價: $2,220
  • 貴賓價: 9.5$2,109
  • 語言: 英文
  • 頁數: 398
  • 裝訂: Paperback
  • ISBN: 1783983647
  • ISBN-13: 9781783983643
  • 相關分類: Hadoop
  • 海外代購書籍(需單獨結帳)

商品描述

Go beyond the basics and master the next generation of Hadoop data processing platforms

About This Book

  • Learn how to optimize Hadoop MapReduce, Pig and Hive
  • Dive into YARN and learn how it can integrate Storm with Hadoop
  • Understand how Hadoop can be deployed on the cloud and gain insights into analytics with Hadoop

Who This Book Is For

Do you want to broaden your Hadoop skill set and take your knowledge to the next level? Do you wish to enhance your knowledge of Hadoop to solve challenging data processing problems? Are your Hadoop jobs, Pig scripts, or Hive queries not working as fast as you intend? Are you looking to understand the benefits of upgrading Hadoop? If the answer is yes to any of these, this book is for you. It assumes novice-level familiarity with Hadoop.

What You Will Learn

  • Understand the changes involved in the process in the move from Hadoop 1.0 to Hadoop 2.0
  • Customize and optimize MapReduce jobs in Hadoop 2.0
  • Explore Hadoop I/O and different data formats
  • Dive into YARN and Storm and use YARN to integrate Storm with Hadoop
  • Deploy Hadoop on Amazon Elastic MapReduce
  • Discover HDFS replacements and learn about HDFS Federation
  • Get to grips with Hadoop's main security aspects
  • Utilize Mahout and RHadoop for Hadoop analytics

In Detail

Hadoop is synonymous with Big Data processing. Its simple programming model, "code once and deploy at any scale" paradigm, and an ever-growing ecosystem makes Hadoop an all-encompassing platform for programmers with different levels of expertise.

This book explores the industry guidelines to optimize MapReduce jobs and higher-level abstractions such as Pig and Hive in Hadoop 2.0. Then, it dives deep into Hadoop 2.0 specific features such as YARN and HDFS Federation.

This book is a step-by-step guide that focuses on advanced Hadoop concepts and aims to take your Hadoop knowledge and skill set to the next level. The data processing flow dictates the order of the concepts in each chapter, and each chapter is illustrated with code fragments or schematic diagrams.

商品描述(中文翻譯)

超越基礎,掌握下一代 Hadoop 數據處理平台

本書介紹


  • 學習如何優化 Hadoop MapReduce、Pig 和 Hive

  • 深入了解 YARN,學習如何將 Storm 與 Hadoop 整合

  • 了解如何在雲端部署 Hadoop,並獲得有關 Hadoop 分析的見解

本書適合誰

您是否想擴展您的 Hadoop 技能,並將知識提升到更高的層次?您是否希望增強對 Hadoop 的了解,以解決具有挑戰性的數據處理問題?您的 Hadoop 作業、Pig 腳本或 Hive 查詢是否沒有如您所預期的那樣快速運行?您是否希望了解升級 Hadoop 的好處?如果以上問題的答案是肯定的,那麼這本書就是為您而寫。它假設讀者對 Hadoop 有初學者級別的熟悉度。

您將學到什麼

  • 了解從 Hadoop 1.0 到 Hadoop 2.0 過程中涉及的變化
  • 自定義和優化 Hadoop 2.0 中的 MapReduce 作業
  • 探索 Hadoop I/O 和不同的數據格式
  • 深入了解 YARN 和 Storm,並使用 YARN 將 Storm 與 Hadoop 整合
  • 在 Amazon Elastic MapReduce 上部署 Hadoop
  • 發現 HDFS 的替代方案並了解 HDFS Federation
  • 掌握 Hadoop 的主要安全方面
  • 利用 Mahout 和 RHadoop 進行 Hadoop 分析

詳細內容

Hadoop 與大數據處理同義。其簡單的編程模型、「一次編碼,隨時部署」的範式,以及不斷增長的生態系統,使 Hadoop 成為不同專業水平的程序員的全方位平台。

本書探討了優化 MapReduce 作業和更高層次抽象(如 Pig 和 Hive)在 Hadoop 2.0 中的行業指導方針。然後,深入研究 Hadoop 2.0 的特定功能,如 YARN 和 HDFS Federation。

本書是一個逐步指南,專注於高級 Hadoop 概念,旨在將您的 Hadoop 知識和技能提升到更高的層次。數據處理流程決定了每章概念的順序,每章都用代碼片段或示意圖進行說明。