Spark: Big Data Cluster Computing in Production (Paperback)
暫譯: Spark:生產環境中的大數據叢集計算(平裝本)

Ilya Ganelin

  • 出版商: Wiley
  • 出版日期: 2016-03-21
  • 定價: $1,650
  • 售價: 9.5$1,568
  • 語言: 英文
  • 頁數: 216
  • 裝訂: Paperback
  • ISBN: 1119254019
  • ISBN-13: 9781119254010
  • 相關分類: Spark大數據 Big-data
  • 立即出貨 (庫存=1)

買這商品的人也買了...

相關主題

商品描述

Production-targeted Spark guidance with real-world use cases Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more. Spark has become the tool of choice for many Big Data problems, with more active contributors than any other Apache Software project. General introductory books abound, but this book is the first to provide deep insight and real-world advice on using Spark in production. Specific guidance, expert tips, and invaluable foresight make this guide an incredibly useful resource for real production settings. * Review Spark hardware requirements and estimate cluster size * Gain insight from real-world production use cases * Tighten security, schedule resources, and fine-tune performance * Overcome common problems encountered using Spark in production Spark works with other big data tools including MapReduce and Hadoop, and uses languages you already know like Java, Scala, Python, and R. Lightning speed makes Spark too good to pass up, but understanding limitations and challenges in advance goes a long way toward easing actual production implementation. Spark: Big Data Cluster Computing in Production tells you everything you need to know, with real-world production insight and expert guidance, tips, and tricks.

商品描述(中文翻譯)

針對生產環境的 Spark 指導與實際案例

《Spark: 大數據叢集運算在生產中的應用》超越了一般的 Spark 概述,提供針對在生產環境中使用快速的大數據叢集的具體指導。這本書由在大數據社群中廣為人知的專家團隊撰寫,帶領您了解從概念驗證或示範 Spark 應用程式轉向實際生產環境中運行 Spark 所面臨的挑戰。實際案例提供了對常見問題、限制、挑戰和機會的深入見解,而專家的提示和技巧則幫助您充分發揮 Spark 的性能。內容涵蓋 Spark SQL、Tachyon、Kerberos、ML Lib、YARN 和 Mesos,並提供有關資源調度、資料庫連接器、串流、安全性等方面的清晰且可行的指導。

Spark 已成為許多大數據問題的首選工具,擁有比其他任何 Apache 軟體專案更多的活躍貢獻者。雖然一般的入門書籍層出不窮,但這本書是第一本提供有關在生產環境中使用 Spark 的深入見解和實際建議的書籍。具體的指導、專家的提示和寶貴的前瞻性使這本指南成為實際生產環境中極為有用的資源。

* 檢視 Spark 硬體需求並估算叢集大小
* 從實際生產案例中獲得見解
* 加強安全性、調度資源並微調性能
* 克服在生產中使用 Spark 時遇到的常見問題

Spark 可以與其他大數據工具如 MapReduce 和 Hadoop 一起使用,並使用您已經熟悉的語言,如 Java、Scala、Python 和 R。閃電般的速度使 Spark 不容錯過,但提前了解限制和挑戰對於減輕實際生產實施的困難大有幫助。《Spark: 大數據叢集運算在生產中的應用》告訴您所需了解的一切,並提供實際生產的見解以及專家的指導、提示和技巧。