Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka
暫譯: 大數據SMACK:Apache Spark、Mesos、Akka、Cassandra與Kafka指南

Raul Estrada, Isaac Ruiz

買這商品的人也買了...

商品描述

This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large datasets in a timely manner. In many cases now, organizations need more than one paradigm to perform efficient analyses.

Big Data SMACK explains each of the full-stack technologies and, more importantly, how to best integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation. The book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by every technology. It covers the six main concepts of big data architecture and how integrate, replace, and reinforce every layer:

  • The language: Scala
  • The engine: Spark (SQL, MLib, Streaming, GraphX)
  • The container: Mesos, Docker
  • The view: Akka
  • The storage: Cassandra
  • The message broker: Kafka

What you’ll learn

  • How to make big data architecture without using complex Greek letter architectures.
  • How to build a cheap but effective cluster infrastructure.
  • How to make queries, reports, and graphs that business demands.
  • How to manage and exploit unstructured and No-SQL data sources.
  • How use tools to monitor the performance of your architecture.
  • How to integrate all technologies and decide which replace and which reinforce.

Who This Book Is For

This book is for developers, data architects, and data scientists looking for how to integrate the most successful big data open stack architecture and how to choose the correct technology in every layer.

商品描述(中文翻譯)

這本書探討如何整合全棧開源大數據架構,以及如何在每一層選擇正確的技術——Scala/Spark、Mesos、Akka、Cassandra 和 Kafka。大數據架構正成為許多不同企業的需求。然而,到目前為止,重點主要集中在及時收集、聚合和處理大型數據集。在許多情況下,組織需要不止一種範式來進行有效的分析。

《Big Data SMACK》解釋了每一種全棧技術,更重要的是,如何最佳地整合它們。它詳細介紹了這些技術的實際好處,並在每種情況中融入了現實世界的例子。這本書專注於架構所解決的問題和場景,以及每種技術提供的解決方案。它涵蓋了大數據架構的六個主要概念,以及如何整合、替換和加強每一層:

- 語言:Scala
- 引擎:Spark(SQL、MLib、Streaming、GraphX)
- 容器:Mesos、Docker
- 視圖:Akka
- 存儲:Cassandra
- 消息代理:Kafka

**你將學到什麼**

- 如何在不使用複雜的希臘字母架構的情況下構建大數據架構。
- 如何建立一個便宜但有效的集群基礎設施。
- 如何製作業務所需的查詢、報告和圖表。
- 如何管理和利用非結構化和 No-SQL 數據源。
- 如何使用工具來監控架構的性能。
- 如何整合所有技術,並決定哪些需要替換,哪些需要加強。

**本書適合誰**

這本書適合尋找如何整合最成功的大數據開源堆棧架構以及如何在每一層選擇正確技術的開發人員、數據架構師和數據科學家。

最後瀏覽商品 (20)