Hadoop: The Definitive Guide (Paperback)
暫譯: Hadoop:權威指南(平裝本)

Tom White

  • 出版商: O'Reilly
  • 出版日期: 2009-06-15
  • 售價: $1,740
  • 貴賓價: 9.5$1,653
  • 語言: 英文
  • 頁數: 528
  • 裝訂: Paperback
  • ISBN: 0596521979
  • ISBN-13: 9780596521974
  • 相關分類: Hadoop
  • 已過版

買這商品的人也買了...

商品描述

Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers will find details for analyzing large datasets, and administrators will learn how to set up and run Hadoop clusters.

Complete with case studies that illustrate how Hadoop solves specific problems, this book helps you:

  • Use the Hadoop Distributed File System (HDFS) for storing large datasets, and run distributed computations over those datasets using MapReduce
  • Become familiar with Hadoop's data and I/O building blocks for compression, data integrity, serialization, and persistence
  • Discover common pitfalls and advanced features for writing real-world MapReduce programs
  • Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud
  • Use Pig, a high-level query language for large-scale data processing
  • Take advantage of HBase, Hadoop's database for structured and semi-structured data
  • Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems

If you have lots of data -- whether it's gigabytes or petabytes -- Hadoop is the perfect solution. Hadoop: The Definitive Guide is the most thorough book available on the subject.

"Now you have the opportunity to learn about Hadoop from a master-not only of the technology, but also of common sense and plain talk." -- Doug Cutting, Hadoop Founder, Yahoo!

商品描述(中文翻譯)

《Hadoop: The Definitive Guide》幫助您充分利用數據的力量。Apache Hadoop 框架是處理大型數據集的理想選擇,它是 MapReduce 演算法的開源實現,Google 正是基於此建立了其帝國。本書是一本全面的資源,展示如何使用 Hadoop 構建可靠、可擴展的分散式系統:程式設計師將找到分析大型數據集的詳細資訊,而管理員將學習如何設置和運行 Hadoop 集群。

本書包含案例研究,說明 Hadoop 如何解決特定問題,幫助您:

- 使用 Hadoop 分散式檔案系統 (HDFS) 存儲大型數據集,並使用 MapReduce 在這些數據集上運行分散式計算
- 熟悉 Hadoop 的數據和 I/O 基礎組件,用於壓縮、數據完整性、序列化和持久性
- 發現撰寫實際 MapReduce 程式的常見陷阱和進階功能
- 設計、構建和管理專用的 Hadoop 集群,或在雲端運行 Hadoop
- 使用 Pig,這是一種用於大規模數據處理的高級查詢語言
- 利用 HBase,Hadoop 的結構化和半結構化數據數據庫
- 學習 ZooKeeper,這是一套用於構建分散式系統的協調原語工具包

如果您擁有大量數據——無論是千兆字節還是拍字節——Hadoop 是完美的解決方案。《Hadoop: The Definitive Guide》是該主題上最全面的書籍。

「現在您有機會向一位大師學習 Hadoop——不僅是技術方面,還有常識和簡單明瞭的表達。」—— Doug Cutting, Hadoop 創始人, Yahoo!