Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs
暫譯: Apache Spark 2.x Java 開發者指南:使用 Apache Spark 2.x Java API 探索大規模大數據

Sourav Gulati, Sumit Kumar

  • 出版商: Packt Publishing
  • 出版日期: 2017-07-27
  • 售價: $2,210
  • 貴賓價: 9.5$2,100
  • 語言: 英文
  • 頁數: 350
  • 裝訂: Paperback
  • ISBN: 1787126498
  • ISBN-13: 9781787126497
  • 相關分類: Java 程式語言Spark
  • 海外代購書籍(需單獨結帳)

  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-1
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-2
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-3
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-4
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-5
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-6
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-7
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-8
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-9
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-10
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-11
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-12
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-13
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-14
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-15
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-16
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-17
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-18
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-19
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-20
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-21
  • Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-22
Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs-preview-1

買這商品的人也買了...

商品描述

Key Features

  • Perform Big Data processing with Spark-without having to learn Scala!
  • Use the Spark Java API to implement efficient enterprise-grade applications for data processing and analytics
  • Go beyond the mainstream data processing by adding querying capability, machine learning, and graph processing using Spark

Book Description

Apache Spark is the buzzword in the Big Data industry right now, especially with the increasing need for real-time streaming and data processing. While Spark is built on Scala, the Spark Java API exposes all the Spark features available in the Scala version for Java developers. This book will show you how you can implement various functionalities of the Apache Spark framework in Java, without stepping out of your comfort zone.

The book starts with introduction to the Apache Spark ecosystem, followed by explaining the Spark installation and configuration, and refreshes the Java concepts that will be useful to you when consuming Apache Spark's APIs. You will explore RDD and its associated common Action and Transformation Java APIs, set up a production-like clustered environment, and work with Spark SQL. Moving on, you will perform near real-time processing with Spark streaming, machine learning analytics with Spark MLlib, and graph processing with GraphX using the various Java packages.

By the end of the book, you will have a solid foundation in implementing the components in the Spark framework in Java to build fast, real-time applications

What you will learn

  • Process data using different file formats such as XML, JSON, CSV, and plain and delimited text using Spark core Library
  • Perform analytics on data from various data sources such as Kafka, Flume, and Twitter using Spark Streaming Library
  • Learn SQL schema creation and analysis of structured data using various SQL functions including Windowing functions of Spark SQL Library
  • Explore the Spark Mlib APIs while implementing machine learning techniques to solve real-world problems
  • Get to know Spark GraphX so you understand various Graph-based analytics that can be performed with Spark

商品描述(中文翻譯)

主要特點

- 使用 Spark 進行大數據處理—無需學習 Scala!
- 使用 Spark Java API 實現高效的企業級數據處理和分析應用程式
- 通過使用 Spark 增加查詢能力、機器學習和圖形處理,超越主流數據處理

書籍描述

Apache Spark 是當前大數據行業的熱門詞彙,特別是在對實時流處理和數據處理需求日益增加的情況下。雖然 Spark 是基於 Scala 開發的,但 Spark Java API 將 Scala 版本中所有可用的 Spark 功能暴露給 Java 開發者。本書將向您展示如何在 Java 中實現 Apache Spark 框架的各種功能,而無需走出您的舒適區。

本書首先介紹 Apache Spark 生態系統,接著解釋 Spark 的安裝和配置,並刷新您在使用 Apache Spark 的 API 時所需的 Java 概念。您將探索 RDD 及其相關的常見 Action 和 Transformation Java API,設置類似生產環境的集群環境,並使用 Spark SQL 進行操作。接下來,您將使用 Spark Streaming 進行近實時處理,使用 Spark MLlib 進行機器學習分析,並使用各種 Java 套件進行 GraphX 的圖形處理。

到本書結束時,您將在 Java 中實現 Spark 框架組件的基礎上,建立快速、實時的應用程式。

您將學到的內容

- 使用 Spark 核心庫處理不同文件格式的數據,如 XML、JSON、CSV 以及純文本和分隔文本
- 使用 Spark Streaming 庫對來自各種數據源(如 Kafka、Flume 和 Twitter)的數據進行分析
- 學習 SQL 架構的創建和結構化數據的分析,使用包括 Spark SQL 庫的窗口函數在內的各種 SQL 函數
- 在實現機器學習技術以解決現實世界問題的同時,探索 Spark Mlib API
- 了解 Spark GraphX,以便理解可以使用 Spark 執行的各種基於圖形的分析