The Definitive Guide to Apache Flink: Next Generation Data Processing
暫譯: Apache Flink 完全指南：下一代數據處理

Name: The Definitive Guide to Apache Flink: Next Generation Data Processing
Price: 1682 TWD
Availability: OnlineOnly
Author: Stefan Papp
ISBN: 1484214080

Stefan Papp

出版商: Apress
出版日期: 2016-06-08
售價: $1,770
貴賓價: 9.5 折 $1,682
語言: 英文
頁數: 400
裝訂: Paperback
ISBN: 1484214080
ISBN-13: 9781484214084

海外代購書籍(需單獨結帳)

商品描述

Data Processing is one of the core functionalities of distributed and cloud computing. There is a high demand on low latency and high performance computing as well as the support of abstract processing methods such as SQL querying, analytic frameworks or graph processing by data processing engines.

The Definitive Guide to Apache Flink by Papp starts with the history of Big Data processing with Hadoop and explains the shortcomings of Map Reduce. It shows how YARN and Hadoop 2.x changed the game and how new technologies started to compete to become the successor of Map Reduce.

After some detailed information on Tez and Spark and how they try to solve shortcomings of Map Reduce, this book deals with some architectural patterns for creating a solid data processing engine, such as advanced pipelining methods or in-memory caching. It shows how Flink is using these concepts.

Flink programming will be introduced in a hands-on approach. It starts with how to create a ten minutes build and how to run the first "Word Count" with Flink. Then it continues with more advanced topics such as programming more complex programs. All samples are programmed with Java or Scala.

It shows that Apache Flink has the potential to become one of the key technologies for distributed computing. It aims to replace many small technologies with a more powerful one that covers many aspects of Hadoop programming.