Learning Apache Apex
暫譯: 學習 Apache Apex

Thomas Weise, Munagala V. Ramanath, David Yan, Kenneth Knowles

  • 出版商: Packt Publishing
  • 出版日期: 2017-11-30
  • 售價: $2,010
  • 貴賓價: 9.5$1,910
  • 語言: 英文
  • 頁數: 290
  • 裝訂: Paperback
  • ISBN: 1788296400
  • ISBN-13: 9781788296403
  • 海外代購書籍(需單獨結帳)

商品描述

Designing and writing a real-time streaming publication with Apache Apex About This Book * Get a clear, practical approach to real-time data processing * Program Apache Apex streaming applications * This book shows you Apex integration with the open source Big Data ecosystem Who This Book Is For This book assumes knowledge of application development with Java and familiarity with distributed systems. Familiarity with other real-time streaming frameworks is not required, but some practical experience with other big data processing utilities might be helpful. What You Will Learn * Put together a functioning Apex application from scratch * Scale an Apex application and configure it for optimal performance * Understand how to deal with failures via the fault tolerance features of the platform * Use Apex via other frameworks such as Beam * Understand the DevOps implications of deploying Apex In Detail Apache Apex is a next-generation, large-scale streaming framework designed to process data streams with minimum latency, maximum reliability, and strict correctness guarantees. Half of the book consists of Apex applications, showing you key aspects of data processing pipelines such as connectors for sources and sinks, and common data transformations. The other half of the book is evenly split into explaining the Apex framework, and tuning, testing, and scaling Apex applications. Much of our economic world depends on torrential streams of data, such as web browsing, social media feeds, financial records, data from mobile devices, and IoT (Internet of Things) sensor feeds. The projects in the book show how to process such streams to gain valuable, timely, and actionable insights. Traditional use cases, such as ETL, that currently consume a significant chunk of data engineering resources are also covered. The final chapter shows you future possibilities emerging in the streaming space, how Apache Apex is future-proofing the streaming space, and how Apache Apex is future-proofing the streaming infrastructure. Style and approach This book is divided into two major parts: first it explains what Apex is, what its relevant parts are, and how to write well-built Apex applications. The second part is entirely application-driven, walking you through Apex applications of increasing complexity.

商品描述(中文翻譯)

設計與撰寫使用 Apache Apex 的即時串流出版物

關於本書
* 獲得清晰、實用的即時數據處理方法
* 程式設計 Apache Apex 串流應用程式
* 本書展示了 Apex 與開源大數據生態系統的整合

本書適合對象
本書假設讀者具備 Java 應用程式開發的知識,並對分散式系統有一定的了解。對其他即時串流框架的熟悉程度並非必需,但對其他大數據處理工具的實務經驗可能會有所幫助。

您將學到的內容
* 從零開始組建一個可運行的 Apex 應用程式
* 擴展 Apex 應用程式並配置其以達到最佳性能
* 理解如何通過平台的容錯特性來處理故障
* 通過其他框架(如 Beam)使用 Apex
* 理解部署 Apex 的 DevOps 影響

詳細內容
Apache Apex 是一個下一代的大規模串流框架,旨在以最小延遲、最大可靠性和嚴格的正確性保證來處理數據流。本書的一半內容由 Apex 應用程式組成,展示了數據處理管道的關鍵方面,例如來源和匯出端的連接器,以及常見的數據轉換。書的另一半則均分為解釋 Apex 框架,以及調整、測試和擴展 Apex 應用程式。當今的經濟世界在很大程度上依賴於大量的數據流,例如網頁瀏覽、社交媒體動態、財務記錄、來自移動設備的數據以及物聯網 (IoT) 感測器數據。書中的專案展示了如何處理這些數據流,以獲得有價值、及時且可行的見解。傳統的使用案例,例如 ETL,目前消耗了大量數據工程資源的情況也有涵蓋。最後一章展示了串流領域中出現的未來可能性,Apache Apex 如何為串流領域未來做好準備,以及 Apache Apex 如何為串流基礎設施未來做好準備。

風格與方法
本書分為兩個主要部分:首先解釋 Apex 是什麼、其相關部分是什麼,以及如何撰寫良好的 Apex 應用程式。第二部分則完全以應用為驅動,帶領您逐步了解日益複雜的 Apex 應用程式。