Hadoop Blueprints
暫譯: Hadoop 藍圖

Anurag Shrivastava, Tanmay Deshpande

  • 出版商: Packt Publishing
  • 出版日期: 2016-09-30
  • 售價: $2,000
  • 貴賓價: 9.5$1,900
  • 語言: 英文
  • 頁數: 316
  • 裝訂: Paperback
  • ISBN: 1783980303
  • ISBN-13: 9781783980307
  • 相關分類: Hadoop
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Use Hadoop to solve business problems by learning from a rich set of real-life case studies

About This Book

  • Solve real-world business problems using Hadoop and other Big Data technologies
  • Build efficient data lakes in Hadoop, and develop systems for various business cases like improving marketing campaigns, fraud detection, and more
  • Power packed with six case studies to get you going with Hadoop for Business Intelligence

Who This Book Is For

If you are interested in building efficient business solutions using Hadoop, this is the book for you This book assumes that you have basic knowledge of Hadoop, Java, and any scripting language.

What You Will Learn

  • Learn about the evolution of Hadoop as the big data platform
  • Understand the basics of Hadoop architecture
  • Build a 360 degree view of your customer using Sqoop and Hive
  • Build and run classification models on Hadoop using BigML
  • Use Spark and Hadoop to build a fraud detection system
  • Develop a churn detection system using Java and MapReduce
  • Build an IoT-based data collection and visualization system
  • Get to grips with building a Hadoop-based Data Lake for large enterprises
  • Learn about the coexistence of NoSQL and In-Memory databases in the Hadoop ecosystem

In Detail

If you have a basic understanding of Hadoop and want to put your knowledge to use to build fantastic Big Data solutions for business, then this book is for you. Build six real-life, end-to-end solutions using the tools in the Hadoop ecosystem, and take your knowledge of Hadoop to the next level.

Start off by understanding various business problems which can be solved using Hadoop. You will also get acquainted with the common architectural patterns which are used to build Hadoop-based solutions. Build a 360-degree view of the customer by working with different types of data, and build an efficient fraud detection system for a financial institution. You will also develop a system in Hadoop to improve the effectiveness of marketing campaigns. Build a churn detection system for a telecom company, develop an Internet of Things (IoT) system to monitor the environment in a factory, and build a data lake all making use of the concepts and techniques mentioned in this book.

The book covers other technologies and frameworks like Apache Spark, Hive, Sqoop, and more, and how they can be used in conjunction with Hadoop. You will be able to try out the solutions explained in the book and use the knowledge gained to extend them further in your own problem space.

Style and approach

This is an example-driven book where each chapter covers a single business problem and describes its solution by explaining the structure of a dataset and tools required to process it. Every project is demonstrated with a step-by-step approach, and explained in a very easy-to-understand manner.

商品描述(中文翻譯)

使用 Hadoop 解決商業問題,透過豐富的實際案例學習

關於本書
- 使用 Hadoop 和其他大數據技術解決現實世界的商業問題
- 在 Hadoop 中建立高效的數據湖,並為各種商業案例開發系統,例如改善行銷活動、詐騙檢測等
- 包含六個案例研究,幫助您開始使用 Hadoop 進行商業智慧

本書適合誰
如果您有興趣使用 Hadoop 建立高效的商業解決方案,那麼這本書就是為您而寫。本書假設您對 Hadoop、Java 和任何腳本語言有基本的了解。

您將學到什麼
- 了解 Hadoop 作為大數據平台的演變
- 理解 Hadoop 架構的基本概念
- 使用 Sqoop 和 Hive 建立客戶的 360 度視圖
- 使用 BigML 在 Hadoop 上建立和運行分類模型
- 使用 Spark 和 Hadoop 建立詐騙檢測系統
- 使用 Java 和 MapReduce 開發流失檢測系統
- 建立基於物聯網 (IoT) 的數據收集和可視化系統
- 掌握為大型企業建立基於 Hadoop 的數據湖
- 了解 NoSQL 和內存數據庫在 Hadoop 生態系統中的共存

詳細內容
如果您對 Hadoop 有基本的了解,並希望將您的知識應用於建立出色的大數據商業解決方案,那麼這本書就是為您而寫。使用 Hadoop 生態系統中的工具建立六個實際的端到端解決方案,並將您的 Hadoop 知識提升到新的水平。

首先了解可以使用 Hadoop 解決的各種商業問題。您還將熟悉用於構建基於 Hadoop 的解決方案的常見架構模式。通過處理不同類型的數據,建立客戶的 360 度視圖,並為金融機構建立高效的詐騙檢測系統。您還將在 Hadoop 中開發一個系統,以提高行銷活動的有效性。為電信公司建立流失檢測系統,開發一個物聯網 (IoT) 系統以監控工廠環境,並建立一個數據湖,所有這些都利用本書中提到的概念和技術。

本書還涵蓋其他技術和框架,如 Apache Spark、Hive、Sqoop 等,以及它們如何與 Hadoop 結合使用。您將能夠嘗試書中解釋的解決方案,並利用所獲得的知識在自己的問題空間中進一步擴展。

風格與方法
這是一本以範例為驅動的書籍,每一章都涵蓋一個商業問題,並通過解釋數據集的結構和處理所需的工具來描述其解決方案。每個項目都以逐步的方法演示,並以非常易於理解的方式進行解釋。