Hadoop Essentials: A Quantitative Approach (Paperback)
暫譯: Hadoop 基礎:量化方法論 (平裝本)

Henry H. Liu

  • 出版商: CreateSpace Independ
  • 出版日期: 2012-10-31
  • 售價: $1,750
  • 貴賓價: 9.5$1,663
  • 語言: 英文
  • 頁數: 356
  • 裝訂: Paperback
  • ISBN: 1480216372
  • ISBN-13: 9781480216372
  • 相關分類: Hadoop
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

This textbook adopts a unique approach to helping developers and CS students learn Hadoop MapReduce programming fast. Rather than filled with disjointed, piecemeal code snippets to show Hadoop MapReduce programming features one at a time, it is designed to place your total Hadoop MapReduce programming learning process in a common application context of mining customer spending patterns ensconced in large volumes of credit card transaction record data. Precise, end-to-end procedures are given to help you set up your Hadoop MapReduce development environment quickly on Eclipse with Maven on Mac OS X or Linux. Step-by-step procedures are also given on how to set up a four-node Linux cluster at minimum so that you can run your MapReduce programs not only in local mode but also in fully distributed mode on a real cluster. In fact, all MapReduce programs presented in the book have been tested and verified in local mode and on such a Linux cluster.

This textbook mainly focuses on teaching Hadoop MapReduce programming in a scientific, objective, quantitative approach. Rather than heavily relying on subjective, verbose (and sometimes even pompous) textual descriptions with sparse code snippets, this textbook uses Hadoop Java APIs, Hadoop configuration parameters, complete MapReduce programs and their execution logs and outputs to demonstrate how Hadoop MapReduce framework works and how to write MapReduce programs. Specifically, this text covers the following subjects:

* Introduction to Hadoop
* Setting up a Linux Hadoop Cluster
* The Hadoop Distributed FileSystem
* MapReduce Job Orchestration and Workflows
* Basic MapReduce Programming
* Advanced MapReduce Programming
* Hadoop Streaming
* Hadoop Administration

No matter what role you play on your team, this text can help you gain truly applicable Hadoop skills in a most effective and efficient manner. The book can also be used as a supplementary textbook for a distributed computing or Hadoop course offered to upper-division college CS students.

商品描述(中文翻譯)

這本教科書採用獨特的方法來幫助開發者和計算機科學學生快速學習 Hadoop MapReduce 程式設計。與其用零散的、片段式的程式碼片段逐一展示 Hadoop MapReduce 的程式設計特性,不如將整個 Hadoop MapReduce 程式設計的學習過程置於一個共同的應用背景中,即挖掘客戶消費模式,這些模式隱藏在大量的信用卡交易記錄數據中。提供精確的端到端程序,幫助您在 Mac OS X 或 Linux 上快速設置 Hadoop MapReduce 開發環境,使用 Eclipse 和 Maven。還提供逐步程序,說明如何至少設置一個四節點的 Linux 集群,以便您可以在本地模式和真正的集群上以完全分散的模式運行 MapReduce 程式。事實上,本書中呈現的所有 MapReduce 程式都已在本地模式和這樣的 Linux 集群上進行測試和驗證。

這本教科書主要專注於以科學、客觀、定量的方法教授 Hadoop MapReduce 程式設計。與其過度依賴主觀、冗長(有時甚至自命不凡)的文字描述和稀疏的程式碼片段,這本教科書使用 Hadoop Java API、Hadoop 配置參數、完整的 MapReduce 程式及其執行日誌和輸出,來展示 Hadoop MapReduce 框架的運作方式以及如何編寫 MapReduce 程式。具體來說,這本書涵蓋以下主題:

* Hadoop 介紹
* 設置 Linux Hadoop 集群
* Hadoop 分散式檔案系統
* MapReduce 工作協調和工作流程
* 基本 MapReduce 程式設計
* 進階 MapReduce 程式設計
* Hadoop Streaming
* Hadoop 管理

無論您在團隊中扮演什麼角色,這本書都能幫助您以最有效和高效的方式獲得真正可應用的 Hadoop 技能。這本書也可以作為針對高年級大學計算機科學學生的分散式計算或 Hadoop 課程的補充教材。