Pro Spark Streaming: The Zen of Real-Time Analytics Using Apache Spark (Paperback)
暫譯: Pro Spark Streaming:使用 Apache Spark 進行即時分析的禪意
Zubair Nabi
- 出版商: Apress
- 出版日期: 2016-06-14
- 售價: $1,760
- 貴賓價: 9.5 折 $1,672
- 語言: 英文
- 頁數: 230
- 裝訂: Paperback
- ISBN: 1484214803
- ISBN-13: 9781484214800
-
相關分類:
Spark
-
相關翻譯:
Spark 實時大數據分析 : 基於 Spark Streaming 框架 (簡中版)
買這商品的人也買了...
-
$690$587 -
$580$452 -
$940$700 -
$280$238 -
$390$332 -
$550$468 -
$380$19 -
$390$304 -
$780$780 -
$780$616 -
$360$306 -
$250鳳凰計畫:一個 IT計畫的傳奇故事 (The Phoenix Project : A Novel about IT, DevOps, and Helping your business win)(沙盤特別版)
-
$580$493 -
$594$564 -
$560$437 -
$403自然語言處理 : 原理與技術實現
-
$420$357 -
$480$408 -
$296區塊鏈: 從數字貨幣到信用社會
-
$352邁向 Angular 2:基於 TypeScript 的高性能 SPA 框架
-
$450$383 -
$580$452 -
$650$553 -
$450$356 -
$580$383
相關主題
商品描述
Learn the right cutting-edge skills and knowledge to leverage Spark Streaming to implement a wide array of real-time, streaming applications. Pro Spark Streaming walks you through end-to-end real-time application development using real-world applications, data, and code. Taking an application-first approach, each chapter introduces use cases from a specific industry and uses publicly available datasets from that domain to unravel the intricacies of production-grade design and implementation. The domains covered in the book include social media, the sharing economy, finance, online advertising, telecommunication, and IoT.
In the last few years, Spark has become synonymous with big data processing. DStreams enhance the underlying Spark processing engine to support streaming analysis with a novel micro-batch processing model. Pro Spark Streaming by Zubair Nabi will enable you to become a specialist of latency sensitive applications by leveraging the key features of DStreams, micro-batch processing, and functional programming. To this end, the book includes ready-to-deploy examples and actual code. Pro Spark Streaming will act as the bible of Spark Streaming.What You'll Learn:
- Spark Streaming application development and best practices Low-level details of discretized streams
- The application and vitality of streaming analytics to a number of industries and domains
- Optimization of production-grade deployments of Spark Streaming via configuration recipes and instrumentation using Graphite, collectd, and Nagios Ingestion of data from disparate sources including MQTT, Flume, Kafka, Twitter, and a custom HTTP receiver
- Integration and coupling with HBase, Cassandra, and Redis
- Design patterns for side-effects and maintaining state across the Spark Streaming micro-batch model Real-time and scalable ETL using data frames, SparkSQL, Hive, and SparkR
- Streaming machine learning, predictive analytics, and recommendations
- Meshing batch processing with stream processing via the Lambda architecture
The audience includes data scientists, big data experts, BI analysts, and data architects.
商品描述(中文翻譯)
學習正確的前沿技能和知識,以利用 Spark Streaming 實現各種即時串流應用程式。《Pro Spark Streaming》將引導您通過使用真實世界的應用程式、數據和代碼,進行端到端的即時應用程式開發。採用以應用程式為中心的方法,每一章都介紹特定行業的使用案例,並使用該領域的公開數據集來揭示生產級設計和實施的複雜性。本書涵蓋的領域包括社交媒體、共享經濟、金融、在線廣告、電信和物聯網。
在過去幾年中,Spark 已經成為大數據處理的代名詞。DStreams 增強了底層的 Spark 處理引擎,以支持使用新穎的微批處理模型進行串流分析。《Pro Spark Streaming》由 Zubair Nabi 撰寫,將使您能夠通過利用 DStreams、微批處理和函數式編程的關鍵特性,成為延遲敏感應用程式的專家。為此,本書包括可即時部署的範例和實際代碼。《Pro Spark Streaming》將成為 Spark Streaming 的聖經。
您將學到的內容:
- Spark Streaming 應用程式開發和最佳實踐
- 離散化串流的低層細節
- 串流分析在多個行業和領域的應用及其活力
- 通過配置食譜和使用 Graphite、collectd 和 Nagios 進行 Spark Streaming 的生產級部署優化
- 從不同來源(包括 MQTT、Flume、Kafka、Twitter 和自定義 HTTP 接收器)獲取數據
- 與 HBase、Cassandra 和 Redis 的整合與耦合
- 在 Spark Streaming 微批處理模型中維護狀態和副作用的設計模式
- 使用數據框、SparkSQL、Hive 和 SparkR 進行即時和可擴展的 ETL
- 串流機器學習、預測分析和推薦
- 通過 Lambda 架構將批處理與串流處理相結合
本書的讀者對象:
本書的受眾包括數據科學家、大數據專家、商業智能分析師和數據架構師。