商品描述
Dive deep into Big Data concepts, platforms, analytics and their applications using the power of Hadoop 3 About This Book * Leverage the power of Hadoop 3 to build effective big data analytics solutions on-premise and on cloud * Integrate Hadoop with other big data tools such as R, Python, Apache Spark and Apache Flink * Get deep insights from your Big Data using Hadoop 3 with the help of real-world examples Who This Book Is For If you are looking to build high-performance analytics solutions for your enterprise or business using Hadoop 3's powerful features, this book is for you. If you're new to Big Data analytics, this book will also help you. A basic understanding of the Java programming language is required for this book. What You Will Learn * Explore the new features of Hadoop 3 along with HDFS, YARN and MapReduce. * Get well-versed with the analytical capabilities of Hadoop ecosystem using practical examples * Integrate Hadoop with R and Python for more efficient big data processing * Learn to use Hadoop with Apache Spark and Apache Flink for real-time data analytics * Setup a Hadoop cluster on AWS cloud * Perform Big Data Analytics on AWS using Elastic Map Reduce In Detail Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. This book shows you how to do just that, with the help of practical examples. You will start with getting a quick overview of the new features introduced in Hadoop 3 along with HDFS, MapReduce and YARN , and how they enable faster, more efficient big data processing. Further, you will learn how to integrate Hadoop with the open source tools such as Python and R to analyse and visualise data and to perform statistical computing on Big Data. The book will also show you how to use Hadoop 3 with Apache Spark and Apache Flink for real-time data analytics and stream processing, and demonstrates how to use Hadoop to build analytics solutions on the cloud. Finally, you will learn to build an end to end pipeline to perform Big Data Analytics using practical use cases. By the end of this book, you will be well-versed with the analytical capabilities of the Hadoop ecosystem. You will be able to build powerful solutions to perform Big Data analytics and get insights from your Big Data without any hassle.
商品描述(中文翻譯)
深入探討大數據概念、平台、分析及其應用,利用 Hadoop 3 的強大功能
關於本書
* 利用 Hadoop 3 的強大功能,在本地和雲端構建有效的大數據分析解決方案
* 將 Hadoop 與其他大數據工具如 R、Python、Apache Spark 和 Apache Flink 整合
* 通過實際案例,深入了解如何使用 Hadoop 3 從大數據中獲取深刻見解
本書適合誰
如果您希望利用 Hadoop 3 的強大功能為您的企業或業務構建高效能的分析解決方案,那麼這本書適合您。如果您是大數據分析的新手,本書也將對您有所幫助。本書需要具備 Java 程式語言的基本理解。
您將學到什麼
* 探索 Hadoop 3 的新功能,以及 HDFS、YARN 和 MapReduce
* 通過實際範例熟悉 Hadoop 生態系統的分析能力
* 將 Hadoop 與 R 和 Python 整合,以提高大數據處理的效率
* 學習如何使用 Hadoop 與 Apache Spark 和 Apache Flink 進行實時數據分析
* 在 AWS 雲端上設置 Hadoop 集群
* 使用 Elastic Map Reduce 在 AWS 上執行大數據分析
詳細內容
Apache Hadoop 是最受歡迎的大數據處理平台,可以與眾多其他大數據工具結合,構建強大的分析解決方案。本書將向您展示如何做到這一點,並提供實際範例。您將首先快速了解 Hadoop 3 中引入的新功能,以及 HDFS、MapReduce 和 YARN,並了解它們如何實現更快、更高效的大數據處理。此外,您將學習如何將 Hadoop 與開源工具如 Python 和 R 整合,以分析和可視化數據,並在大數據上執行統計計算。本書還將展示如何使用 Hadoop 3 與 Apache Spark 和 Apache Flink 進行實時數據分析和流處理,並演示如何使用 Hadoop 在雲端構建分析解決方案。最後,您將學會構建端到端的管道,以實際案例執行大數據分析。到本書結束時,您將熟悉 Hadoop 生態系統的分析能力,能夠輕鬆構建強大的解決方案來執行大數據分析,並從您的大數據中獲取見解。