Hadoop Beginner's Guide (Paperback)
暫譯: Hadoop 初學者指南 (平裝本)

Garry Turkington

  • 出版商: Packt Publishing
  • 出版日期: 2013-02-11
  • 售價: $2,220
  • 貴賓價: 9.5$2,109
  • 語言: 英文
  • 頁數: 398
  • 裝訂: Paperback
  • ISBN: 1849517304
  • ISBN-13: 9781849517300
  • 相關分類: Hadoop
  • 海外代購書籍(需單獨結帳)

買這商品的人也買了...

商品描述

Get your mountain of data under control with Hadoop. This guide requires no prior knowledge of the software or cloud services - just a willingness to learn the basics from this practical step-by-step tutorial.

Overview

  • Learn tools and techniques that let you approach big data with relish and not fear.
  • Shows how to build a complete infrastructure to handle your needs as your data grows.
  • Hands-on examples in each chapter give the big picture while also giving direct experience.

In Detail

Data is arriving faster than you can process it and the overall volumes keep growing at a rate that keeps you awake at night. Hadoop can help you tame the data beast. Effective use of Hadoop however requires a mixture of programming, design, and system administration skills.

"Hadoop Beginner's Guide" removes the mystery from Hadoop, presenting Hadoop and related technologies with a focus on building working systems and getting the job done, using cloud services to do so when it makes sense. From basic concepts and initial setup through developing applications and keeping the system running as the data grows, the book gives the understanding needed to effectively use Hadoop to solve real world problems.

Starting with the basics of installing and configuring Hadoop, the book explains how to develop applications, maintain the system, and how to use additional products to integrate with other systems.

While learning different ways to develop applications to run on Hadoop the book also covers tools such as Hive, Sqoop, and Flume that show how Hadoop can be integrated with relational databases and log collection .

In addition to examples on Hadoop clusters on Ubuntu uses of cloud services such as Amazon, EC2 and Elastic MapReduce are covered.

What you will learn from this book

  • The trends that led to Hadoop and cloud services, giving the background to know when to use the technology.
  • Best practices for setup and configuration of Hadoop clusters, tailoring the system to the problem at hand
  • Developing applications to run on Hadoop with examples in Java and Ruby
  • How Amazon Web Services can be used to deliver a hosted Hadoop solution and how this differs from directly-managed environments
  • Integration with relational databases, using Hive for SQL queries and Sqoop for data transfer
  • How Flume can collect data from multiple sources and deliver it to Hadoop for processing
  • What other projects and tools make up the broader Hadoop ecosystem and where to go next

Approach

As a Packt Beginner's Guide, the book is packed with clear step-by-step instructions for performing the most useful tasks, getting you up and running quickly, and learning by doing.

Who this book is written for

This book assumes no existing experience with Hadoop or cloud services. It assumes you have familiarity with a programming language such as Java or Ruby but gives you the needed background on the other topics.

商品描述(中文翻譯)

控制你的數據山脈,使用 Hadoop。本指南不需要對該軟體或雲服務的先前知識,只需有學習基礎的意願,透過這個實用的逐步教程來學習。

概述
- 學習工具和技術,讓你能夠愉快地面對大數據,而不是感到恐懼。
- 展示如何建立完整的基礎設施,以應對隨著數據增長而產生的需求。
- 每章的實作範例提供全貌,同時也給予直接的經驗。

詳細內容
數據的到來速度超過了你的處理能力,整體的數量以讓你夜不能寐的速度不斷增長。Hadoop 可以幫助你馴服數據巨獸。然而,有效使用 Hadoop 需要編程、設計和系統管理技能的混合。

《Hadoop 初學者指南》消除了 Hadoop 的神秘感,專注於構建可運行的系統並完成任務,並在合適的情況下使用雲服務。從基本概念和初始設置到開發應用程序以及隨著數據增長保持系統運行,本書提供了有效使用 Hadoop 解決現實世界問題所需的理解。

本書從安裝和配置 Hadoop 的基本知識開始,解釋如何開發應用程序、維護系統,以及如何使用其他產品與其他系統集成。

在學習不同的應用程序開發方式以在 Hadoop 上運行的同時,本書還涵蓋了如 Hive、Sqoop 和 Flume 等工具,展示了 Hadoop 如何與關聯數據庫和日誌收集進行集成。

除了在 Ubuntu 上的 Hadoop 集群範例外,還涵蓋了使用 Amazon、EC2 和 Elastic MapReduce 等雲服務的應用。

你將從本書中學到的內容
- 導致 Hadoop 和雲服務的趨勢,提供背景以了解何時使用這項技術。
- Hadoop 集群的設置和配置最佳實踐,根據當前問題量身定制系統。
- 開發在 Hadoop 上運行的應用程序,並提供 Java 和 Ruby 的範例。
- 如何使用 Amazon Web Services 提供託管的 Hadoop 解決方案,以及這與直接管理環境的區別。
- 與關聯數據庫的集成,使用 Hive 進行 SQL 查詢,使用 Sqoop 進行數據傳輸。
- Flume 如何從多個來源收集數據並將其傳送到 Hadoop 進行處理。
- 其他項目和工具如何構成更廣泛的 Hadoop 生態系統,以及接下來的學習方向。

方法
作為 Packt 的初學者指南,本書充滿了清晰的逐步指導,幫助你快速上手,並通過實踐學習。

本書的讀者對象
本書假設讀者對 Hadoop 或雲服務沒有現有經驗。它假設你對 Java 或 Ruby 等編程語言有一定的熟悉度,但會提供其他主題所需的背景知識。

最後瀏覽商品 (17)