Hadoop Essentials - Tackling the Challenges of Big Data with Hadoop
暫譯: Hadoop 基礎 - 解決大數據挑戰的 Hadoop 實務指南

Shiva Achari

  • 出版商: Packt Publishing
  • 出版日期: 2015-04-30
  • 售價: $1,470
  • 貴賓價: 9.5$1,397
  • 語言: 英文
  • 頁數: 172
  • 裝訂: Paperback
  • ISBN: 1784396680
  • ISBN-13: 9781784396688
  • 相關分類: Hadoop大數據 Big-data
  • 海外代購書籍(需單獨結帳)

商品描述

Key Features

  • Get to grips with the most powerful tools in the Hadoop ecosystem, including Storm and Spark
  • Learn everything you need to take control of Big Data
  • A fast-paced journey through the key features of Hadoop

Book Description

This book jumps into the world of Hadoop and its tools, to help you learn how to use them effectively to optimize and improve the way you handle Big Data.

Starting with the fundamentals Hadoop YARN, MapReduce, HDFS, and other vital elements in the Hadoop ecosystem, you will soon learn many exciting topics such as MapReduce patterns, data management, and real-time data analysis using Hadoop. You will also explore a number of the leading data processing tools including Hive and Pig, and learn how to use Sqoop and Flume, two of the most powerful technologies used for data ingestion. With further guidance on data streaming and real-time analytics with Storm and Spark, Hadoop Essentials is a reliable and relevant resource for anyone who understands the difficulties - and opportunities - presented by Big Data today.

With this guide, you'll develop your confidence with Hadoop, and be able to use the knowledge and skills you learn to successfully harness its unparalleled capabilities.

What you will learn

  • Get to grips with the fundamentals of Hadoop, and tools such as HDFS, MapReduce, and YARN
  • Learn how to use Hadoop for real-world Big Data projects
  • Improve the performance of your Big Data architecture
  • Find out how to get the most from data processing tools such as Hive and Pig
  • Learn how to unlock real-time Big Data analytics with Apache Spark

About the Author

Shiva Achari has more than 8 years of extensive industry experience and is currently working as a Big Data Architect consultant with companies such as Oracle and Teradata. Over the years, he has architected, designed, and developed multiple innovative and high-performance large-scale solutions, such as distributed systems, data centers, big data management tools, SaaS cloud applications, Internet applications, and Data Analytics solutions.

Table of Contents

  1. Introduction to Big Data and Hadoop
  2. Hadoop Ecosystem
  3. Pillars of Hadoop HDFS, MapReduce, and YARN
  4. Data Access Components Hive and Pig
  5. Storage Component HBase
  6. Data Ingestion in Hadoop Sqoop and Flume
  7. Streaming and Real-time Analysis Storm and Spark

商品描述(中文翻譯)

**主要特點**
- 熟悉 Hadoop 生態系統中最強大的工具,包括 Storm 和 Spark
- 學習掌握大數據所需的一切
- 快速了解 Hadoop 的關鍵特性

**書籍描述**
本書深入探討 Hadoop 及其工具,幫助您學習如何有效使用它們來優化和改善您處理大數據的方式。
從 Hadoop 的基本概念開始,包括 Hadoop YARN、MapReduce、HDFS 及其他生態系統中的重要元素,您將很快學習到許多令人興奮的主題,例如 MapReduce 模式、數據管理以及使用 Hadoop 進行實時數據分析。您還將探索多個領先的數據處理工具,包括 Hive 和 Pig,並學習如何使用 Sqoop 和 Flume,這兩種用於數據攝取的強大技術。隨著對數據流和使用 Storm 和 Spark 進行實時分析的進一步指導,《Hadoop Essentials》是任何理解當今大數據所帶來的挑戰和機遇的人的可靠且相關的資源。
通過這本指南,您將增強對 Hadoop 的信心,並能夠成功利用所學的知識和技能來駕馭其無與倫比的能力。

**您將學到的內容**
- 熟悉 Hadoop 的基本概念,以及 HDFS、MapReduce 和 YARN 等工具
- 學習如何將 Hadoop 用於實際的大數據項目
- 改善您的大數據架構性能
- 瞭解如何充分利用 Hive 和 Pig 等數據處理工具
- 學習如何使用 Apache Spark 解鎖實時大數據分析

**關於作者**
**Shiva Achari** 擁有超過 8 年的豐富行業經驗,目前擔任 Oracle 和 Teradata 等公司的大數據架構師顧問。多年來,他設計、架構和開發了多個創新且高性能的大型解決方案,如分佈式系統、數據中心、大數據管理工具、SaaS 雲應用、互聯網應用和數據分析解決方案。

**目錄**
1. 大數據與 Hadoop 簡介
2. Hadoop 生態系統
3. Hadoop 的支柱:HDFS、MapReduce 和 YARN
4. 數據訪問組件:Hive 和 Pig
5. 存儲組件:HBase
6. Hadoop 中的數據攝取:Sqoop 和 Flume
7. 流式處理和實時分析:Storm 和 Spark

最後瀏覽商品 (1)