Data Lake for Enterprises
暫譯: 企業數據湖

Tomcy John, Pankaj Misra

  • 出版商: Packt Publishing
  • 出版日期: 2017-05-31
  • 售價: $1,600
  • 貴賓價: 9.5$1,520
  • 語言: 英文
  • 頁數: 596
  • 裝訂: Paperback
  • ISBN: 1787281345
  • ISBN-13: 9781787281349
  • 海外代購書籍(需單獨結帳)

商品描述

About This Book

  • Build a full-fledged data lake for your organization with popular big data technologies using the Lambda architecture as the base
  • Delve into the big data technologies required to meet modern day business strategies
  • A highly practical guide to implementing enterprise data lakes with lots of examples and real-world use-cases

Who This Book Is For

Java developers and architects who would like to implement a data lake for their enterprise will find this book useful. If you want to get hands-on experience with the Lambda Architecture and big data technologies by implementing a practical solution using these technologies, this book will also help you.

What You Will Learn

  • Build an enterprise-level data lake using the relevant big data technologies
  • Understand the core of the Lambda architecture and how to apply it in an enterprise
  • Learn the technical details around Sqoop and its functionalities
  • Integrate Kafka with Hadoop components to acquire enterprise data
  • Use flume with streaming technologies for stream-based processing
  • Understand stream- based processing with reference to Apache Spark Streaming
  • Incorporate Hadoop components and know the advantages they provide for enterprise data lakes
  • Build fast, streaming, and high-performance applications using ElasticSearch
  • Make your data ingestion process consistent across various data formats with configurability
  • Process your data to derive intelligence using machine learning

商品描述(中文翻譯)

關於本書

- 使用 Lambda 架構作為基礎,為您的組織建立一個完整的資料湖,並運用流行的大數據技術
- 深入探討滿足現代商業策略所需的大數據技術
- 一本高度實用的指南,提供許多範例和實際案例,幫助實施企業資料湖

本書適合誰

本書對於希望為其企業實施資料湖的 Java 開發人員和架構師將會非常有用。如果您想透過實施實際解決方案來獲得 Lambda 架構和大數據技術的實作經驗,本書也將對您有所幫助。

您將學到什麼

- 使用相關的大數據技術建立企業級資料湖
- 理解 Lambda 架構的核心及其在企業中的應用
- 學習有關 Sqoop 及其功能的技術細節
- 將 Kafka 與 Hadoop 組件整合以獲取企業數據
- 使用 Flume 與串流技術進行基於串流的處理
- 參考 Apache Spark Streaming 理解基於串流的處理
- 整合 Hadoop 組件並了解它們為企業資料湖提供的優勢
- 使用 ElasticSearch 建立快速、串流和高效能的應用程式
- 使您的數據攝取過程在各種數據格式中保持一致,並具備可配置性
- 使用機器學習處理您的數據以獲取智慧

最後瀏覽商品 (20)