Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2 (Paperback)
暫譯: Apache Hadoop YARN:超越 MapReduce 與批次處理的 Apache Hadoop 2

Arun Murthy, Vinod Vavilapalli, Douglas Eadline, Joseph Niemiec, Jeff Markham

  • 出版商: Addison Wesley
  • 出版日期: 2014-03-29
  • 定價: $1,395
  • 售價: 8.0$1,116
  • 語言: 英文
  • 頁數: 400
  • 裝訂: Paperback
  • ISBN: 0321934504
  • ISBN-13: 9780321934505
  • 相關分類: Hadoop分散式架構
  • 立即出貨

買這商品的人也買了...

商品描述

“This book is a critically needed resource for the newly released Apache Hadoop 2.0, highlighting YARN as the significant breakthrough that broadens Hadoop beyond the MapReduce paradigm.”
—From the Foreword by Raymie Stata, CEO of Altiscale


The Insider’s Guide to Building Distributed, Big Data Applications with Apache Hadoop™ YARN

 

Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop™ YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revolutionary advances.

 

YARN project founder Arun Murthy and project lead Vinod Kumar Vavilapalli demonstrate how YARN increases scalability and cluster utilization, enables new programming models and services, and opens new options beyond Java and batch processing. They walk you through the entire YARN project lifecycle, from installation through deployment.

 

You’ll find many examples drawn from the authors’ cutting-edge experience—first as Hadoop’s earliest developers and implementers at Yahoo! and now as Hortonworks developers moving the platform forward and helping customers succeed with it.

 

Coverage includes

  • YARN’s goals, design, architecture, and components—how it expands the Apache Hadoop ecosystem
  • Exploring YARN on a single node 
  • Administering YARN clusters and Capacity Scheduler 
  • Running existing MapReduce applications 
  • Developing a large-scale clustered YARN application 
  • Discovering new open source frameworks that run under YARN

商品描述(中文翻譯)

「這本書是針對新發布的 Apache Hadoop 2.0 的一個關鍵資源,突顯了 YARN 作為一個重要的突破,將 Hadoop 擴展到超越 MapReduce 的範疇。」

—摘自 Altiscale 的 CEO Raymie Stata 的前言

Apache Hadoop™ YARN 建立分散式大數據應用的內部指南

Apache Hadoop 正在推動大數據革命。現在,它的數據處理已經完全改造:Apache Hadoop YARN 提供了數據中心規模的資源管理,以及更簡單的方式來創建處理 PB 級數據的分散式應用。而在 Apache Hadoop™ YARN 中,兩位 Hadoop 技術領導者將向您展示如何開發新應用並調整現有代碼,以充分利用這些革命性的進展。

YARN 項目創始人 Arun Murthy 和項目負責人 Vinod Kumar Vavilapalli 展示了 YARN 如何提高可擴展性和集群利用率,啟用新的編程模型和服務,並開啟超越 Java 和批處理的新選項。他們將帶您了解整個 YARN 項目生命周期,從安裝到部署。

您將發現許多來自作者前沿經驗的範例——他們最初是 Yahoo! 的 Hadoop 早期開發者和實施者,現在則是 Hortonworks 的開發者,推動平台向前發展並幫助客戶成功使用它。

內容涵蓋:

- YARN 的目標、設計、架構和組件——它如何擴展 Apache Hadoop 生態系統
- 在單節點上探索 YARN
- 管理 YARN 集群和容量調度器
- 運行現有的 MapReduce 應用
- 開發大規模集群 YARN 應用
- 發現運行在 YARN 下的新開源框架