Apache Accumulo for Developers
暫譯: Apache Accumulo 開發者指南

Guðmundur Jón Halldórsson

  • 出版商: Packt Publishing
  • 出版日期: 2013-10-13
  • 售價: $1,540
  • 貴賓價: 9.5$1,463
  • 語言: 英文
  • 頁數: 120
  • 裝訂: Paperback
  • ISBN: 1783285990
  • ISBN-13: 9781783285990
  • 海外代購書籍(需單獨結帳)

商品描述

Discover how to build Accumulo, Hadoop, and ZooKeeper clusters from scratch on both Windows and Linux. With this book's examples-based approach, you'll learn the painless way through clear instructions and real-world exercises.

Overview

  • Shows you how to build Accumulo, Hadoop, and ZooKeeper clusters from scratch on both Windows and Linux
  • Allows you to get hands-on knowledge about how to run Accumulo on Amazon EC2, Google Cloud Platform, Rackspace, and Windows Azure Cloud platforms
  • Packed with practical examples to enable you to manipulate Accumulo with ease

In Detail

Accumulo is a sorted and distributed key/value store designed to handle large amounts of data. Being highly robust and scalable, its performance makes it ideal for real-time data storage. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, Zookeeper, and Thrift.

Apache Accumulo for Developers is your guide to building an Accumulo cluster both as a single-node and multi-node, on-site and in the cloud. Accumulo has been proven to be able to handle petabytes of data, with cell-level security, and real-time analyses so this is your step by step guide in taking full advantage of this power.

Apache Accumulo for Developers looks at the process of setting up three systems - Hadoop, ZooKeeper, and Accumulo – and configuring, monitoring, and securing them.

You will learn to connect Accumulo to both Hadoop and ZooKeeper. You will also learn how to monitor the cluster (single-node or multi-node) to find any performance bottlenecks, and then integrate to Amazon EC2, Google Cloud Platform, Rackspace, and Windows Azure. When integrating with these cloud platforms, we will focus on scripting as well.

You will also learn to troubleshoot clusters with monitoring tools, and use Accumulo cell-level security to secure your data.

What you will learn from this book

  • Set up Hadoop, ZooKeeper, and Accumulo
  • Monitor clusters - both performance and application logs
  • Secure your data in Accumulo
  • Optimize Hadoop, ZooKeeper, and Accumulo performance
  • Integrate to various cloud platforms
  • Use the Accumulo command-line shell
  • Employ Ganglina to monitor the cluster and Graylog2 to monitor application logs
  • Understand what tools are needed to optimize Accumulo performance

Approach

The book will have a tutorial-based approach that will show the readers how to start from scratch with building an Accumulo cluster and learning how to monitor the system and implement aspects such as security.

Who this book is written for

This book is great for developers new to Accumulo, who are looking to get a good grounding in how to use Accumulo. It's assumed that you have an understanding of how Hadoop works, both HDFS and the Map/Reduce. No prior knowledge of ZooKeeper is assumed.

商品描述(中文翻譯)

發現如何在 Windows 和 Linux 上從零開始建立 Accumulo、Hadoop 和 ZooKeeper 叢集。這本書採用基於範例的方法,透過清晰的指示和實際的練習,讓您輕鬆學習。

概述
- 教您如何在 Windows 和 Linux 上從零開始建立 Accumulo、Hadoop 和 ZooKeeper 叢集
- 讓您獲得有關如何在 Amazon EC2、Google Cloud Platform、Rackspace 和 Windows Azure 雲平台上運行 Accumulo 的實作知識
- 充滿實用範例,使您能夠輕鬆操作 Accumulo

詳細內容
Accumulo 是一種排序和分佈的鍵/值存儲,旨在處理大量數據。由於其高度穩健和可擴展性,其性能使其非常適合實時數據存儲。Apache Accumulo 基於 Google 的 BigTable 設計,並建立在 Apache Hadoop、Zookeeper 和 Thrift 之上。

《Apache Accumulo for Developers》是您建立 Accumulo 叢集的指南,無論是單節點還是多節點,無論是在本地還是在雲端。Accumulo 已被證明能夠處理 PB 級別的數據,具備單元級安全性和實時分析,因此這是您充分利用這一強大功能的逐步指南。

《Apache Accumulo for Developers》探討了設置三個系統 - Hadoop、ZooKeeper 和 Accumulo 的過程,以及如何配置、監控和保護它們。

您將學會如何將 Accumulo 連接到 Hadoop 和 ZooKeeper。您還將學會如何監控叢集(單節點或多節點),以找出任何性能瓶頸,然後整合到 Amazon EC2、Google Cloud Platform、Rackspace 和 Windows Azure。在與這些雲平台整合時,我們也將重點放在腳本編寫上。

您還將學會使用監控工具來排除叢集故障,並使用 Accumulo 的單元級安全性來保護您的數據。

您將從這本書中學到的內容
- 設置 Hadoop、ZooKeeper 和 Accumulo
- 監控叢集 - 包括性能和應用程序日誌
- 在 Accumulo 中保護您的數據
- 優化 Hadoop、ZooKeeper 和 Accumulo 的性能
- 整合到各種雲平台
- 使用 Accumulo 命令行介面
- 使用 Ganglia 監控叢集,使用 Graylog2 監控應用程序日誌
- 了解優化 Accumulo 性能所需的工具

方法
本書將採用基於教程的方法,向讀者展示如何從零開始建立 Accumulo 叢集,並學習如何監控系統和實施安全等方面。

本書的讀者對象
這本書非常適合新接觸 Accumulo 的開發人員,幫助他們建立使用 Accumulo 的良好基礎。假設您對 Hadoop 的運作有一定了解,包括 HDFS 和 Map/Reduce。對於 ZooKeeper 不需要任何先前的知識。