Hadoop Blueprints

Anurag Shrivastava, Tanmay Deshpande

  • 出版商: Packt Publishing
  • 出版日期: 2016-09-30
  • 售價: $1,980
  • 貴賓價: 9.5$1,881
  • 語言: 英文
  • 頁數: 316
  • 裝訂: Paperback
  • ISBN: 1783980303
  • ISBN-13: 9781783980307
  • 相關分類: Hadoop
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Use Hadoop to solve business problems by learning from a rich set of real-life case studies

About This Book

  • Solve real-world business problems using Hadoop and other Big Data technologies
  • Build efficient data lakes in Hadoop, and develop systems for various business cases like improving marketing campaigns, fraud detection, and more
  • Power packed with six case studies to get you going with Hadoop for Business Intelligence

Who This Book Is For

If you are interested in building efficient business solutions using Hadoop, this is the book for you This book assumes that you have basic knowledge of Hadoop, Java, and any scripting language.

What You Will Learn

  • Learn about the evolution of Hadoop as the big data platform
  • Understand the basics of Hadoop architecture
  • Build a 360 degree view of your customer using Sqoop and Hive
  • Build and run classification models on Hadoop using BigML
  • Use Spark and Hadoop to build a fraud detection system
  • Develop a churn detection system using Java and MapReduce
  • Build an IoT-based data collection and visualization system
  • Get to grips with building a Hadoop-based Data Lake for large enterprises
  • Learn about the coexistence of NoSQL and In-Memory databases in the Hadoop ecosystem

In Detail

If you have a basic understanding of Hadoop and want to put your knowledge to use to build fantastic Big Data solutions for business, then this book is for you. Build six real-life, end-to-end solutions using the tools in the Hadoop ecosystem, and take your knowledge of Hadoop to the next level.

Start off by understanding various business problems which can be solved using Hadoop. You will also get acquainted with the common architectural patterns which are used to build Hadoop-based solutions. Build a 360-degree view of the customer by working with different types of data, and build an efficient fraud detection system for a financial institution. You will also develop a system in Hadoop to improve the effectiveness of marketing campaigns. Build a churn detection system for a telecom company, develop an Internet of Things (IoT) system to monitor the environment in a factory, and build a data lake all making use of the concepts and techniques mentioned in this book.

The book covers other technologies and frameworks like Apache Spark, Hive, Sqoop, and more, and how they can be used in conjunction with Hadoop. You will be able to try out the solutions explained in the book and use the knowledge gained to extend them further in your own problem space.

Style and approach

This is an example-driven book where each chapter covers a single business problem and describes its solution by explaining the structure of a dataset and tools required to process it. Every project is demonstrated with a step-by-step approach, and explained in a very easy-to-understand manner.

商品描述(中文翻譯)

使用Hadoop解決業務問題,通過學習豐富的實際案例研究。

關於本書
- 使用Hadoop和其他大數據技術解決現實世界的業務問題
- 在Hadoop中建立高效的數據湖,並為改進市場營銷活動、詐騙檢測等各種業務案例開發系統
- 包含六個案例研究,讓您開始使用Hadoop進行商業智能

本書適合對使用Hadoop構建高效業務解決方案感興趣的讀者。本書假設您具有Hadoop、Java和任何腳本語言的基本知識。

您將學到什麼
- 了解Hadoop作為大數據平台的演進
- 理解Hadoop架構的基礎知識
- 使用Sqoop和Hive構建客戶的360度視圖
- 使用BigML在Hadoop上構建和運行分類模型
- 使用Spark和Hadoop構建詐騙檢測系統
- 使用Java和MapReduce開發流失檢測系統
- 構建基於物聯網的數據收集和可視化系統
- 掌握為大型企業構建基於Hadoop的數據湖
- 了解Hadoop生態系統中NoSQL和內存數據庫的共存

詳細內容
如果您對Hadoop有基本的了解,並希望將您的知識應用於構建出色的大數據解決方案,那麼本書適合您。使用Hadoop生態系統中的工具構建六個現實生活中的端到端解決方案,將您對Hadoop的知識提升到更高的水平。

首先,了解可以使用Hadoop解決的各種業務問題。您還將熟悉用於構建基於Hadoop的解決方案的常見架構模式。通過處理不同類型的數據,建立客戶的360度視圖,並為金融機構構建高效的詐騙檢測系統。您還將開發一個在Hadoop中提高市場營銷活動效果的系統。使用Hadoop為電信公司構建一個流失檢測系統,開發一個用於監測工廠環境的物聯網系統,並使用本書中提到的概念和技術構建數據湖。

本書還涵蓋了其他技術和框架,如Apache Spark、Hive、Sqoop等,以及它們如何與Hadoop結合使用。您將能夠嘗試本書中解釋的解決方案,並將所學知識應用於自己的問題領域。

風格和方法
本書以實例驅動,每章涵蓋一個業務問題,並通過解釋數據集的結構和處理所需的工具來描述其解決方案。每個項目都以逐步的方式演示,並以非常易於理解的方式解釋。