The Enterprise Big Data Lake
暫譯: 企業大數據湖

Alex Gorelik

  • 出版商: O'Reilly
  • 出版日期: 2019-04-16
  • 定價: $2,600
  • 售價: 9.5$2,470
  • 語言: 英文
  • 頁數: 200
  • 裝訂: Paperback
  • ISBN: 1491931558
  • ISBN-13: 9781491931554
  • 相關分類: Hadoop大數據 Big-dataData Science
  • 立即出貨 (庫存=1)

買這商品的人也買了...

相關主題

商品描述

Enterprises are experimenting with using Hadoop to build Big Data Lakes, but many projects are stalling or failing because the approaches that worked at Internet companies have to be adopted for the enterprise. This practical handbook guides managers and IT professionals from the initial research and decision-making process through planning, choosing products, and implementing, maintaining, and governing the modern data lake.

You'll explore various approaches to starting and growing a Data Lake, including Data Warehouse off-loading, analytical sandboxes, and "Data Puddles." Author Alex Gorelik shows you methods for setting up different tiers of data, from raw untreated landing areas to carefully managed and summarized data. You'll learn how to enable self-service to help users find, understand, and provision data; how to provide different interfaces to users with different skill levels; and how to do all of that in compliance with enterprise data governance policies.

商品描述(中文翻譯)

企業正在嘗試使用 Hadoop 來建立大數據湖,但許多專案因為需要將在互聯網公司有效的方法轉換為企業適用的方式而停滯或失敗。本實用手冊指導管理者和 IT 專業人員從初步研究和決策過程開始,經過規劃、選擇產品,直到實施、維護和管理現代數據湖。

您將探索啟動和擴展數據湖的各種方法,包括數據倉儲卸載、分析沙盒和「數據水坑」。作者 Alex Gorelik 向您展示如何設置不同層級的數據,從原始未處理的登陸區域到經過精心管理和總結的數據。您將學習如何啟用自助服務,幫助用戶查找、理解和提供數據;如何為不同技能水平的用戶提供不同的介面;以及如何在遵循企業數據治理政策的前提下完成所有這些工作。

最後瀏覽商品 (18)