Agile Data Science 2.0: Building Full-Stack Data Analytics Applications with Spark
暫譯: 敏捷數據科學 2.0:使用 Spark 建立全棧數據分析應用程式
Russell Jurney
- 出版商: O'Reilly
- 出版日期: 2017-07-18
- 定價: $1,930
- 售價: 8.0 折 $1,544
- 語言: 英文
- 頁數: 352
- 裝訂: Paperback
- ISBN: 1491960116
- ISBN-13: 9781491960110
-
相關分類:
Spark、Agile Software、Data Science
-
相關翻譯:
Spark 全棧數據分析 (簡中版)
立即出貨 (庫存 < 4)
買這商品的人也買了...
相關主題
商品描述
Agile Data Science 2.0 covers the theory and practice of applying agile methods to the practice of applied analytics research called data science. The book takes the stance that data products are the preferred output format for data science teams to effect change in an organization. Accordingly, we show how to "get meta" to enable agility in building applications describing the applied research process itself. Then we show how to use big data tools to iteratively build, deploy and refine analytics applications. Tracking data-product development through the five stages of the "data value pyramid", we show you how to build applications from conception through development through deployment and then through iterative improvement. Application development is a fundamental skill for a data scientist, and by publishing your data science work as a web application, we show you how to effect maximal change within your organization.
Technologies covered include Python, Apache Spark (Spark MLlib, Spark Streaming), Apache Kafka, MongoDB, ElasticSearch, d3.js, scikit-learn and Apache Airflow. More important than any one technology, we show you how to compose a data platform to make you a productive application developer.
商品描述(中文翻譯)
《Agile Data Science 2.0》涵蓋了將敏捷方法應用於稱為數據科學的應用分析研究的理論與實踐。本書認為,數據產品是數據科學團隊在組織中實現變革的首選輸出格式。因此,我們展示了如何「獲得元」以促進在構建描述應用研究過程本身的應用程序時的敏捷性。接著,我們展示了如何使用大數據工具來迭代地構建、部署和完善分析應用程序。通過追蹤數據產品開發的五個階段,即「數據價值金字塔」,我們向您展示如何從構思到開發,再到部署,然後通過迭代改進來構建應用程序。應用程序開發是數據科學家的基本技能,通過將您的數據科學工作發布為網絡應用程序,我們向您展示如何在您的組織內實現最大的變革。
本書涵蓋的技術包括 Python、Apache Spark(Spark MLlib、Spark Streaming)、Apache Kafka、MongoDB、ElasticSearch、d3.js、scikit-learn 和 Apache Airflow。比起任何單一技術,我們更重要的是展示如何組合數據平台,使您成為一名高效的應用程序開發者。