Practical Enterprise Data Lake Insights: Handle Data-Driven Challenges in an Enterprise Big Data Lake
暫譯: 實用企業數據湖洞察:應對企業大數據湖中的數據驅動挑戰
Saurabh Gupta, Venkata Giri
- 出版商: Apress
- 出版日期: 2018-06-28
- 定價: $1,650
- 售價: 6.0 折 $990
- 語言: 英文
- 頁數: 327
- 裝訂: Paperback
- ISBN: 1484235215
- ISBN-13: 9781484235218
-
相關分類:
大數據 Big-data
立即出貨 (庫存 < 3)
商品描述
Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues.
When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn the nuances of handling non-relational data. Starting from sourcing data into the Hadoop ecosystem, you will go through stages that can bring up tough questions such as data processing, data querying, and security. Concepts such as change data capture and data streaming are covered. The book takes an end-to-end solution approach in a data lake environment that includes data security, high availability, data processing, data streaming, and more.
Each chapter includes application of a concept, code snippets, and use case demonstrations to provide you with a practical approach. You will learn the concept, scope, application, and starting point.
What You'll Learn
- Get to know data lake architecture and design principles
- Implement data capture and streaming strategies
- Implement data processing strategies in Hadoop
- Understand the data lake security framework and availability model
Who This Book Is For
Big data architects and solution architects
商品描述(中文翻譯)
使用這本實用指南,成功應對設計企業數據湖時遇到的挑戰,並學習行業最佳實踐以解決問題。
在設計企業數據湖時,您經常會遇到瓶頸,必須離開關聯數據的舒適區,學習處理非關聯數據的細微差別。從將數據引入Hadoop生態系統開始,您將經歷一些階段,這些階段可能會提出一些棘手的問題,例如數據處理、數據查詢和安全性。書中涵蓋了變更數據捕獲(change data capture)和數據流(data streaming)等概念。該書採取端到端的解決方案方法,涵蓋數據湖環境中的數據安全性、高可用性、數據處理、數據流等內容。
每一章都包括概念的應用、代碼片段和案例演示,以提供實用的方法。您將學習到概念、範疇、應用和起始點。
您將學到的內容:
- 了解數據湖架構和設計原則
- 實施數據捕獲和流式處理策略
- 在Hadoop中實施數據處理策略
- 理解數據湖安全框架和可用性模型
本書適合於:
大數據架構師和解決方案架構師