Spark: The Definitive Guide: Big Data Processing Made Simple (Paperback)
暫譯: Spark:權威指南:簡化大數據處理
Bill Chambers, Matei Zaharia
- 出版商: O'Reilly
- 出版日期: 2018-04-03
- 定價: $2,450
- 售價: 8.0 折 $1,960
- 語言: 英文
- 頁數: 606
- 裝訂: Paperback
- ISBN: 1491912219
- ISBN-13: 9781491912218
-
相關分類:
Spark、大數據 Big-data
-
相關翻譯:
Spark 技術手冊|輕鬆寫意處理大數據 (Spark: The Definitive Guide|Big Data Processing Made Simple) (繁中版)
立即出貨 (庫存 < 3)
買這商品的人也買了...
-
$653C++ Primer, 5/e (簡體中文版)
-
$788Functional Programming in Scala (Paperback)
-
$1,570$1,492 -
$1,820Hadoop: The Definitive Guide, 4/e (Paperback)
-
$780$616 -
$420$357 -
$560$476 -
$580$458 -
$1,364PostgreSQL: Up and Running: A Practical Guide to the Advanced Open Source Database, 3/e (Paperback)
-
$924Practical Monitoring: Effective Strategies for the Real World
-
$780$663 -
$1,584Flask Web Development : Developing Web Applications with Python, 2/e (Paperback)
-
$690$538 -
$450$351 -
$311Clojure 編程實戰, 2/e (Clojure in Action, 2/e)
-
$1,584Programming TypeScript: Making Your JavaScript Applications Scale
-
$550$363 -
$888$844 -
$580$458 -
$780$663 -
$1,824Learning React: Modern Patterns for Developing React Apps, 2/e
-
$2,800$2,660 -
$580$493 -
$680$537 -
$520$411
相關主題
商品描述
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of this open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals.
You’ll explore the basic operations and common functions of Spark’s structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications. Developers and system administrators will learn the fundamentals of monitoring, tuning, and debugging Spark, and explore machine learning techniques and scenarios for employing MLlib, Spark’s scalable machine learning library.
- Get a gentle overview of big data and Spark
- Learn about DataFrames, SQL, and Datasets—Spark’s core APIs—through worked examples
- Dive into Spark’s low-level APIs, RDDs, and execution of SQL and DataFrames
- Understand how Spark runs on a cluster
- Debug, monitor, and tune Spark clusters and applications
- Learn the power of Spark’s Structured Streaming and MLlib for machine learning tasks
- Explore the wider Spark ecosystem, including SparkR and Graph Analysis
- Examine Spark deployment, including coverage of Spark in the Cloud
商品描述(中文翻譯)
學習如何使用、部署和維護 Apache Spark,這本由該開源集群計算框架的創建者撰寫的綜合指南。作者 Bill Chambers 和 Matei Zaharia 強調 Spark 2.0 的改進和新功能,將 Spark 主題分解為不同的部分,每個部分都有獨特的目標。
您將探索 Spark 結構化 API 的基本操作和常見功能,以及結構化流(Structured Streaming),這是一個用於構建端到端流應用程序的新高級 API。開發人員和系統管理員將學習監控、調整和調試 Spark 的基本原則,並探索使用 MLlib(Spark 的可擴展機器學習庫)的機器學習技術和場景。
- 獲得大數據和 Spark 的簡要概述
- 通過實作範例了解 DataFrames、SQL 和 Datasets——Spark 的核心 API
- 深入了解 Spark 的低級 API、RDD 和 SQL 及 DataFrames 的執行
- 理解 Spark 如何在集群上運行
- 調試、監控和調整 Spark 集群和應用程序
- 學習 Spark 的結構化流和 MLlib 在機器學習任務中的強大功能
- 探索更廣泛的 Spark 生態系統,包括 SparkR 和圖形分析
- 檢查 Spark 的部署,包括雲端中的 Spark 覆蓋範圍