Apache Spark 2.x Cookbook
暫譯: Apache Spark 2.x 食譜

Name: Apache Spark 2.x Cookbook
Price: 1320 TWD
Availability: InStock
Author: Rishi Yadav
ISBN: 1787127265

Rishi Yadav

預覽內頁

出版商: Packt Publishing
出版日期: 2017-05-31
定價: $1,650
售價: 8.0 折 $1,320
語言: 英文
頁數: 294
裝訂: Paperback
ISBN: 1787127265
ISBN-13: 9781787127265
相關分類: Spark

立即出貨 (庫存=1)

買這商品的人也買了...

~~$1,950~~ $1,853

Machine Learning With R Cookbook - 110 Recipes for Building Powerful Predictive Models with R (Paperback)
~~$1,780~~ $1,691

Pro Spark Streaming: The Zen of Real-Time Analytics Using Apache Spark (Paperback)
~~$2,420~~ $2,299

Python Deep Learning (Paperback)
~~$590~~ $460

TensorFlow + Keras 深度學習人工智慧實務應用
$2,144

Deep Learning: Practical Neural Networks with Java

商品描述

Key Features

Contains recipes on solving real-time data-processing problems with Apache Spark
Utilize core Spark modules such as Spark SQL, Spark MLlib, Spark Streaming, and GraphX processing
A practical guide to help you master Apache Spark as your single big data computing platform

Book Description

While Apache Spark 1.x gained lot of traction and adoption in the early years, Spark 2.0 delivers very notable improvements in the areas of API, Performance, Structured Streaming, and simplifying building blocks to build better, faster, smarter, and accessible big data applications. This book uncovers all these features in the form of structured recipes to analyze and mature large and complex sets of data.

Starting with installing and configuring Apache Spark with various cluster managers, you will learn to set up development environments. Furthermore, you will be introduced to working with RDD's, Data Frames to operate on data with schemas, and real-time streaming with various sources such as Twitter Stream and Apache Kafka. You will also work through recipes on machine learning, including supervised learning, unsupervised learning, recommendation engines, deep learning algorithms, and GPU implementations on Spark.

Last but not the least, the final few chapters will help you delve more deeply into the concepts of graph processing using GraphX, securing your implementations, cluster optimization, and troubleshooting.

What you will learn

Install and configure Apache Spark with various cluster managers
Set up a development environment for Apache Spark
Learn to operate on data in Spark with schemas
Get to grips with real-time streaming analytics using Spark Streaming
Master supervised learning and unsupervised learning using MLlib
Build a recommendation engine using MLlib
Use Tensorframes to manipulate Spark's DataFrames with TensorFlow programs for deep learning
Develop a set of common applications or project types, and solutions that solve complex big data problems

商品描述(中文翻譯)

**主要特點**

- 包含使用 Apache Spark 解決即時數據處理問題的範例
- 利用核心 Spark 模組，如 Spark SQL、Spark MLlib、Spark Streaming 和 GraphX 處理
- 實用指南，幫助您掌握 Apache Spark 作為單一的大數據計算平台

**書籍描述**

雖然 Apache Spark 1.x 在早期獲得了大量的關注和採用，但 Spark 2.0 在 API、性能、結構化流處理以及簡化構建模塊方面提供了顯著的改進，以便構建更好、更快、更智能且可訪問的大數據應用程式。本書以結構化的範例形式揭示了所有這些特性，以分析和成熟大型且複雜的數據集。

從安裝和配置 Apache Spark 及各種叢集管理器開始，您將學習如何設置開發環境。此外，您將接觸到使用 RDD 和 Data Frames 操作具有結構的數據，以及使用 Twitter Stream 和 Apache Kafka 等各種來源進行即時流處理。您還將學習機器學習的範例，包括監督式學習、非監督式學習、推薦引擎、深度學習算法以及在 Spark 上的 GPU 實現。

最後幾章將幫助您更深入地探討使用 GraphX 的圖形處理概念、保護您的實現、叢集優化和故障排除。

**您將學到的內容**

- 安裝和配置 Apache Spark 及各種叢集管理器
- 為 Apache Spark 設置開發環境
- 學習如何在 Spark 中操作具有結構的數據
- 熟悉使用 Spark Streaming 進行即時流分析
- 精通使用 MLlib 的監督式學習和非監督式學習
- 使用 MLlib 構建推薦引擎
- 使用 Tensorframes 操作 Spark 的 DataFrames，並結合 TensorFlow 程式進行深度學習
- 開發一組常見應用程式或專案類型，以及解決複雜大數據問題的解決方案