Learning Apache Spark 2
暫譯: 學習 Apache Spark 2
Muhammad Asif Abbasi
- 出版商: Packt Publishing
- 出版日期: 2017-03-24
- 售價: $2,010
- 貴賓價: 9.5 折 $1,910
- 語言: 英文
- 頁數: 356
- 裝訂: Paperback
- ISBN: 1785885138
- ISBN-13: 9781785885136
-
相關分類:
Spark
海外代購書籍(需單獨結帳)
商品描述
Key Features
- Exclusive guide that covers how to get up and running with fast data processing using Apache Spark
- Explore and exploit various possibilities with Apache Spark using real-world use cases in this book
- Want to perform efficient data processing at real time? This book will be your one-stop solution.
Book Description
Spark juggernaut keeps on rolling and getting more and more momentum each day. Spark provides key capabilities in the form of Spark SQL, Spark Streaming, Spark ML and Graph X all accessible via Java, Scala, Python and R. Deploying the key capabilities is crucial whether it is on a Standalone framework or as a part of existing Hadoop installation and configuring with Yarn and Mesos.
The next part of the journey after installation is using key components, APIs, Clustering, machine learning APIs, data pipelines, parallel programming. It is important to understand why each framework component is key, how widely it is being used, its stability and pertinent use cases.
Once we understand the individual components, we will take a couple of real life advanced analytics examples such as ‘Building a Recommendation system', ‘Predicting customer churn' and so on.
The objective of these real life examples is to give the reader confidence of using Spark for real-world problems.
What you will learn
- Get an overview of big data analytics and its importance for organizations and data professionals
- Delve into Spark to see how it is different from existing processing platforms
- Understand the intricacies of various file formats, and how to process them with Apache Spark.
- Realize how to deploy Spark with YARN, MESOS
商品描述(中文翻譯)
主要特點
- 獨家指南,涵蓋如何使用 Apache Spark 快速進行數據處理
- 探索並利用本書中的實際案例,發掘 Apache Spark 的各種可能性
- 想要實現實時高效數據處理嗎?這本書將是您的全方位解決方案。
書籍描述
Spark 的發展勢頭持續增強,每天都在增長。Spark 提供了關鍵功能,包括 Spark SQL、Spark Streaming、Spark ML 和 Graph X,這些功能均可通過 Java、Scala、Python 和 R 訪問。無論是在獨立框架上還是作為現有 Hadoop 安裝的一部分,部署這些關鍵功能都是至關重要的,並且需要與 Yarn 和 Mesos 進行配置。
安裝後的下一步是使用關鍵組件、API、集群、機器學習 API、數據管道和並行編程。了解每個框架組件為何重要、其使用的廣泛性、穩定性及相關用例是非常重要的。
一旦我們理解了各個組件,我們將以幾個現實生活中的高級分析示例為例,例如「建立推薦系統」、「預測客戶流失」等等。
這些現實生活示例的目的是讓讀者對使用 Spark 解決現實問題充滿信心。
您將學到什麼
- 瞭解大數據分析的概述及其對組織和數據專業人士的重要性
- 深入了解 Spark,看看它與現有處理平台的不同之處
- 理解各種文件格式的複雜性,以及如何使用 Apache Spark 處理它們
- 瞭解如何使用 YARN 和 MESOS 部署 Spark