Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2 (Paperback)

Arun Murthy, Vinod Vavilapalli, Douglas Eadline, Joseph Niemiec, Jeff Markham

  • 出版商: Addison Wesley
  • 出版日期: 2014-03-29
  • 定價: $1,395
  • 售價: 8.0$1,116
  • 語言: 英文
  • 頁數: 400
  • 裝訂: Paperback
  • ISBN: 0321934504
  • ISBN-13: 9780321934505
  • 相關分類: Hadoop分散式架構
  • 立即出貨

買這商品的人也買了...

相關主題

商品描述

“This book is a critically needed resource for the newly released Apache Hadoop 2.0, highlighting YARN as the significant breakthrough that broadens Hadoop beyond the MapReduce paradigm.”
—From the Foreword by Raymie Stata, CEO of Altiscale


The Insider’s Guide to Building Distributed, Big Data Applications with Apache Hadoop™ YARN

 

Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop™ YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revolutionary advances.

 

YARN project founder Arun Murthy and project lead Vinod Kumar Vavilapalli demonstrate how YARN increases scalability and cluster utilization, enables new programming models and services, and opens new options beyond Java and batch processing. They walk you through the entire YARN project lifecycle, from installation through deployment.

 

You’ll find many examples drawn from the authors’ cutting-edge experience—first as Hadoop’s earliest developers and implementers at Yahoo! and now as Hortonworks developers moving the platform forward and helping customers succeed with it.

 

Coverage includes

  • YARN’s goals, design, architecture, and components—how it expands the Apache Hadoop ecosystem
  • Exploring YARN on a single node 
  • Administering YARN clusters and Capacity Scheduler 
  • Running existing MapReduce applications 
  • Developing a large-scale clustered YARN application 
  • Discovering new open source frameworks that run under YARN

商品描述(中文翻譯)

「這本書是針對新釋出的Apache Hadoop 2.0所需的重要資源,強調YARN作為重大突破,將Hadoop擴展到MapReduce範式之外。」
—來自Altiscale的CEO Raymie Stata的前言

「內幕指南:使用Apache Hadoop™ YARN建立分散式大數據應用程式」

Apache Hadoop正在推動大數據革命。現在,它的數據處理已經完全改進:Apache Hadoop YARN在數據中心規模上提供資源管理,並提供更簡單的方式來創建處理PB級數據的分散式應用程式。現在,在《Apache Hadoop™ YARN》中,兩位Hadoop技術領導者將向您展示如何開發新應用程式並適應現有程式碼,以充分利用這些革命性的進展。

YARN項目的創始人Arun Murthy和項目負責人Vinod Kumar Vavilapalli展示了YARN如何提高可擴展性和叢集利用率,實現新的編程模型和服務,並在Java和批處理之外提供新的選擇。他們將引導您完成整個YARN項目的生命周期,從安裝到部署。

您將找到許多來自作者的實踐經驗的例子,他們首先是Hadoop的最早開發者和實施者,現在是Hortonworks的開發人員,推動平台發展並幫助客戶成功。

內容包括:
- YARN的目標、設計、架構和組件,以及它如何擴展Apache Hadoop生態系統
- 在單個節點上探索YARN
- 管理YARN叢集和容量調度器
- 執行現有的MapReduce應用程式
- 開發大規模叢集化的YARN應用程式
- 探索在YARN下運行的新開源框架