Apache Accumulo for Developers

Guðmundur Jón Halldórsson

  • 出版商: Packt Publishing
  • 出版日期: 2013-10-13
  • 售價: $1,510
  • 貴賓價: 9.5$1,435
  • 語言: 英文
  • 頁數: 120
  • 裝訂: Paperback
  • ISBN: 1783285990
  • ISBN-13: 9781783285990
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Discover how to build Accumulo, Hadoop, and ZooKeeper clusters from scratch on both Windows and Linux. With this book's examples-based approach, you'll learn the painless way through clear instructions and real-world exercises.

Overview

  • Shows you how to build Accumulo, Hadoop, and ZooKeeper clusters from scratch on both Windows and Linux
  • Allows you to get hands-on knowledge about how to run Accumulo on Amazon EC2, Google Cloud Platform, Rackspace, and Windows Azure Cloud platforms
  • Packed with practical examples to enable you to manipulate Accumulo with ease

In Detail

Accumulo is a sorted and distributed key/value store designed to handle large amounts of data. Being highly robust and scalable, its performance makes it ideal for real-time data storage. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, Zookeeper, and Thrift.

Apache Accumulo for Developers is your guide to building an Accumulo cluster both as a single-node and multi-node, on-site and in the cloud. Accumulo has been proven to be able to handle petabytes of data, with cell-level security, and real-time analyses so this is your step by step guide in taking full advantage of this power.

Apache Accumulo for Developers looks at the process of setting up three systems - Hadoop, ZooKeeper, and Accumulo – and configuring, monitoring, and securing them.

You will learn to connect Accumulo to both Hadoop and ZooKeeper. You will also learn how to monitor the cluster (single-node or multi-node) to find any performance bottlenecks, and then integrate to Amazon EC2, Google Cloud Platform, Rackspace, and Windows Azure. When integrating with these cloud platforms, we will focus on scripting as well.

You will also learn to troubleshoot clusters with monitoring tools, and use Accumulo cell-level security to secure your data.

What you will learn from this book

  • Set up Hadoop, ZooKeeper, and Accumulo
  • Monitor clusters - both performance and application logs
  • Secure your data in Accumulo
  • Optimize Hadoop, ZooKeeper, and Accumulo performance
  • Integrate to various cloud platforms
  • Use the Accumulo command-line shell
  • Employ Ganglina to monitor the cluster and Graylog2 to monitor application logs
  • Understand what tools are needed to optimize Accumulo performance

Approach

The book will have a tutorial-based approach that will show the readers how to start from scratch with building an Accumulo cluster and learning how to monitor the system and implement aspects such as security.

Who this book is written for

This book is great for developers new to Accumulo, who are looking to get a good grounding in how to use Accumulo. It's assumed that you have an understanding of how Hadoop works, both HDFS and the Map/Reduce. No prior knowledge of ZooKeeper is assumed.

商品描述(中文翻譯)

發現如何在 Windows 和 Linux 上從零開始建立 Accumulo、Hadoop 和 ZooKeeper 叢集。透過本書的範例導向方法,您將透過清晰的指示和實際的練習學習無痛的方式。

概述
- 教您如何在 Windows 和 Linux 上從零開始建立 Accumulo、Hadoop 和 ZooKeeper 叢集
- 讓您獲得有關如何在 Amazon EC2、Google Cloud Platform、Rackspace 和 Windows Azure 雲平台上運行 Accumulo 的實作知識
- 充滿實用範例,使您能夠輕鬆操作 Accumulo

詳細內容
Accumulo 是一種排序和分散式的鍵/值存儲,旨在處理大量數據。由於其高度穩健和可擴展的特性,其性能使其非常適合實時數據存儲。Apache Accumulo 基於 Google 的 BigTable 設計,並建立在 Apache Hadoop、Zookeeper 和 Thrift 之上。

《Apache Accumulo for Developers》是您建立 Accumulo 叢集的指南,無論是單節點還是多節點,無論是在現場還是在雲端。Accumulo 已被證明能夠處理 PB 級別的數據,具備單元級安全性和實時分析,因此這是您充分利用這一強大功能的逐步指南。

《Apache Accumulo for Developers》將探討設置三個系統 - Hadoop、ZooKeeper 和 Accumulo 的過程,以及如何配置、監控和保護它們。

您將學會如何將 Accumulo 連接到 Hadoop 和 ZooKeeper。您還將學會如何監控叢集(單節點或多節點),以找出任何性能瓶頸,然後整合到 Amazon EC2、Google Cloud Platform、Rackspace 和 Windows Azure。在與這些雲平台整合時,我們也將專注於腳本編寫。

您還將學會使用監控工具來排除叢集故障,並使用 Accumulo 的單元級安全性來保護您的數據。

您將從本書中學到的內容
- 設置 Hadoop、ZooKeeper 和 Accumulo
- 監控叢集 - 包括性能和應用程序日誌
- 在 Accumulo 中保護您的數據
- 優化 Hadoop、ZooKeeper 和 Accumulo 的性能
- 整合到各種雲平台
- 使用 Accumulo 命令行介面
- 使用 Ganglia 監控叢集,使用 Graylog2 監控應用程序日誌
- 了解優化 Accumulo 性能所需的工具

方法
本書將採用基於教程的方法,向讀者展示如何從零開始建立 Accumulo 叢集,並學習如何監控系統和實施安全等方面。

本書的讀者對象
本書非常適合對 Accumulo 新手的開發人員,他們希望深入了解如何使用 Accumulo。假設您對 Hadoop 的運作有一定的了解,包括 HDFS 和 Map/Reduce。對 ZooKeeper 不需要任何先前的知識。