Practical Apache Spark: Using the Scala API
暫譯: 實用 Apache Spark:使用 Scala API

Subhashini Chellappan, Dharanitharan Ganesan

  • 出版商: Apress
  • 出版日期: 2018-12-13
  • 定價: $2,100
  • 售價: 8.0$1,680
  • 語言: 英文
  • 頁數: 280
  • 裝訂: Paperback
  • ISBN: 1484236513
  • ISBN-13: 9781484236512
  • 相關分類: JVM 語言Spark
  • 立即出貨(限量) (庫存=2)

  • Practical Apache Spark: Using the Scala API-preview-1
  • Practical Apache Spark: Using the Scala API-preview-2
  • Practical Apache Spark: Using the Scala API-preview-3
  • Practical Apache Spark: Using the Scala API-preview-4
  • Practical Apache Spark: Using the Scala API-preview-5
  • Practical Apache Spark: Using the Scala API-preview-6
  • Practical Apache Spark: Using the Scala API-preview-7
  • Practical Apache Spark: Using the Scala API-preview-8
Practical Apache Spark: Using the Scala API-preview-1

相關主題

商品描述

Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You’ll follow a learn-to-do-by-yourself approach to learning – learn the concepts, practice the code snippets in Scala, and complete the assignments given to get an overall exposure. 

On completion, you’ll have knowledge of the functional programming aspects of Scala, and hands-on expertise in various Spark components. You’ll also become familiar with machine learning algorithms with real-time usage.

 
What You Will Learn
  • Discover the functional programming features of Scala
  • Understand the complete architecture of Spark and its components
  • Integrate Apache Spark with Hive and Kafka 
  • Use Spark SQL, DataFrames, and Datasets to process data using traditional SQL queries
  • Work with different machine learning concepts and libraries using Spark's MLlib packages
 
Who This Book Is For
 
Developers and professionals who deal with batch and stream data processing. 

 

商品描述(中文翻譯)

使用 Scala 與 Apache Spark 進行單節點、多節點及高可用性叢集的部署與設置。本書討論了 Spark 的各種組件,如 Spark Core、DataFrames、Datasets 和 SQL、Spark Streaming、Spark MLib,以及在 Spark 上使用 R,並針對每個主題提供實用的程式碼片段。《實用 Apache Spark》還涵蓋了 Apache Spark 與 Kafka 的整合,並提供範例。您將採用自學的方式進行學習——了解概念、在 Scala 中練習程式碼片段,並完成給定的作業,以獲得全面的曝光。

完成後,您將掌握 Scala 的函數式程式設計方面的知識,並在各種 Spark 組件上獲得實作經驗。您還將熟悉機器學習演算法的實時應用。

您將學到的內容:

- 探索 Scala 的函數式程式設計特性
- 理解 Spark 的完整架構及其組件
- 將 Apache Spark 與 Hive 和 Kafka 整合
- 使用 Spark SQL、DataFrames 和 Datasets 透過傳統 SQL 查詢處理數據
- 使用 Spark 的 MLlib 套件處理不同的機器學習概念和庫

本書適合對象:

處理批次和串流數據處理的開發人員和專業人士。