Pro Hadoop Data Analytics: Designing and Building Big Data Systems using the Hadoop Ecosystem
暫譯: 專業 Hadoop 數據分析:設計與建構基於 Hadoop 生態系統的大數據系統
Kerry Koitzsch
- 出版商: Apress
- 出版日期: 2016-12-29
- 售價: $1,680
- 貴賓價: 9.5 折 $1,596
- 語言: 英文
- 頁數: 298
- 裝訂: Paperback
- ISBN: 1484219090
- ISBN-13: 9781484219096
-
相關分類:
Hadoop、大數據 Big-data、Data Science
已過版
買這商品的人也買了...
-
$1,808The Art of SEO: Mastering Search Engine Optimization, 3/e (Paperback)
-
$1,660$1,577 -
$990Spark in Action
-
$1,575Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS (paperback)
-
$2,800Algorithms for Data Science
-
$2,010$1,910 -
$798Deep Learning with Hadoop (Paperback)
-
$2,420$2,299 -
$1,500$1,425
商品描述
Learn advanced analytical techniques and leverage existing toolkits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems which go beyond the basics of classification, clustering, and recommendation.
In Pro Hadoop Data Analytics best practices are emphasized to ensure coherent, efficient development. A complete example system will be developed using standard third-party components which will consist of the toolkits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system.
The book emphasizes four important topics:
- The importance of end-to-end, flexible, configurable, high-performance data pipeline systems with analytical components as well as appropriate visualization results.
- Best practices and structured design principles. This will include strategic topics as well as the how to example portions.
- The importance of mix-and-match or hybrid systems, using different analytical components in one application to accomplish application goals. The hybrid approach will be prominent in the examples.
- Use of existing third-party libraries is key to effective development. Deep dive examples of the functionality of some of these toolkits will be showcased as you develop the example system.
What You'll Learn
- The what, why, and how of building big data analytic systems with the Hadoop ecosystem
- Libraries, toolkits, and algorithms to make development easier and more effective
- Best practices to use when building analytic systems with Hadoop, and metrics to measure performance and efficiency of components and systems
- How to connect to standard relational databases, noSQL data sources, and more
- Useful case studies and example components which assist you in creating your own systems
商品描述(中文翻譯)
學習進階的分析技術並利用現有的工具包,使您的分析應用程式更強大、精確且高效。本書提供了架構、設計和實作資訊的正確組合,以創建超越分類、聚類和推薦基本概念的分析系統。
在Pro Hadoop Data Analytics中,強調最佳實踐以確保一致且高效的開發。將使用標準的第三方元件開發一個完整的範例系統,這些元件將包括工具包、函式庫、視覺化和報告代碼,以及支援的黏合程式,以提供一個可運作且可擴展的端到端系統。
本書強調四個重要主題:
- 端到端、靈活、可配置的高效能數據管道系統的重要性,這些系統具有分析元件以及適當的視覺化結果。
- 最佳實踐和結構化設計原則。這將包括策略性主題以及如何實作的範例部分。
- 混合或混搭系統的重要性,在一個應用程式中使用不同的分析元件以達成應用目標。混合方法將在範例中佔據重要地位。
- 使用現有的第三方函式庫是有效開發的關鍵。在開發範例系統的過程中,將深入探討這些工具包的一些功能範例。
您將學到什麼
- 使用Hadoop生態系統構建大數據分析系統的什麼、為什麼和如何
- 使開發更簡單和更有效的函式庫、工具包和演算法
- 在使用Hadoop構建分析系統時的最佳實踐,以及衡量元件和系統性能與效率的指標
- 如何連接到標準關聯數據庫、NoSQL數據源等
- 有用的案例研究和範例元件,幫助您創建自己的系統