HBase Administration Cookbook
暫譯: HBase 管理實用手冊
Yifeng Jiang
- 出版商: Packt Publishing
- 出版日期: 2012-08-17
- 售價: $1,900
- 貴賓價: 9.5 折 $1,805
- 語言: 英文
- 頁數: 332
- 裝訂: Paperback
- ISBN: 1849517142
- ISBN-13: 9781849517140
-
相關分類:
NoSQL
立即出貨 (庫存=1)
買這商品的人也買了...
-
$1,176Database Management Systems, 3/e (IE-Paperback)
-
$880$695 -
$420$357 -
$520$406 -
$220$174 -
$550$468 -
$420$357 -
$1,320$1,254 -
$1,000$700 -
$490$382 -
$580$493 -
$480$408 -
$450$383 -
$480$374 -
$296Shell 從入門到精通
-
$1,116Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2 (Paperback)
-
$880$695 -
$680$530 -
$719$683 -
$680$578 -
$490$323 -
$580$493 -
$380$300 -
$420$328 -
$199$169
商品描述
Master HBase configuration and administration for optimum database performance
- Move large amounts of data into HBase and learn how to manage it efficiently
- Set up HBase on the cloud, get it ready for production, and run it smoothly with high performance
- Maximize the ability of HBase with the Hadoop eco-system including HDFS, MapReduce, Zookeeper, and Hive
In Detail
As an Open Source distributed big data store, HBase scales to billions of rows, with millions of columns and sits on top of the clusters of commodity machines. If you are looking for a way to store and access a huge amount of data in real-time, then look no further than HBase.
HBase Administration Cookbook provides practical examples and simple step-by-step instructions for you to administrate HBase with ease. The recipes cover a wide range of processes for managing a fully distributed, highly available HBase cluster on the cloud. Working with such a huge amount of data means that an organized and manageable process is key and this book will help you to achieve that.
The recipes in this practical cookbook start from setting up a fully distributed HBase cluster and moving data into it. You will learn how to use all of the tools for day-to-day administration tasks as well as for efficiently managing and monitoring the cluster to achieve the best performance possible. Understanding the relationship between Hadoop and HBase will allow you to get the best out of HBase so the book will show you how to set up Hadoop clusters, configure Hadoop to cooperate with HBase, and tune its performance.
What you will learn from this book
- Set up a fully distributed, highly available HBase cluster and load data into it using the normal client API or your own MapReduce job
- Access data in HBase via HBase Shell or Hive using its SQL-like query language
- Backup and restore HBase table, along with its data distribution, and move or replicate data between different HBase clusters
- Gather metrics then show them in graphs, monitor the cluster's status, and get notified if thresholds are exceeded
- Tune your kernel settings with JVM GC, Hadoop, and HBase configuration to maximize the performance
- Discover troubleshooting tools and tips in order to avoid the most commonly-found problems with HBase
- Gain optimum performance with data compression, region splits, and by manually managing compaction
- Learn advanced configuration and tuning for read and write-heavy clusters
Approach
As part of Packt's cookbook series, each recipe offers a practical, step-by-step solution to common problems found in HBase administration.
Who this book is written for
This book is for HBase administrators, developers, and will even help Hadoop administrators. You are not required to have HBase experience, but are expected to have a basic understanding of Hadoop and MapReduce.
商品描述(中文翻譯)
掌握 HBase 的配置和管理,以達到最佳的資料庫性能
- 將大量資料移入 HBase,並學習如何有效管理
- 在雲端上設置 HBase,為生產環境做好準備,並以高性能順利運行
- 最大化 HBase 與 Hadoop 生態系統的整合,包括 HDFS、MapReduce、Zookeeper 和 Hive
詳細內容
作為一個開源的分散式大數據儲存系統,HBase 可以擴展到數十億行,擁有數百萬列,並運行在一群商用機器的集群上。如果您正在尋找一種方法來實時儲存和訪問大量資料,那麼 HBase 是您的最佳選擇。
《HBase 管理食譜》提供實用的範例和簡單的逐步指導,讓您輕鬆管理 HBase。這些食譜涵蓋了在雲端管理一個完全分散、高可用的 HBase 集群的各種過程。處理如此大量的資料意味著有組織且可管理的過程至關重要,而本書將幫助您實現這一目標。
這本實用的食譜從設置一個完全分散的 HBase 集群開始,並將資料移入其中。您將學習如何使用所有工具來執行日常管理任務,以及如何有效管理和監控集群,以達到最佳性能。理解 Hadoop 與 HBase 之間的關係將使您能夠充分發揮 HBase 的潛力,因此本書將指導您如何設置 Hadoop 集群,配置 Hadoop 以配合 HBase,並調整其性能。
您將從本書中學到的內容
- 設置一個完全分散、高可用的 HBase 集群,並使用正常的客戶端 API 或您自己的 MapReduce 工作將資料載入
- 通過 HBase Shell 或使用類 SQL 查詢語言的 Hive 訪問 HBase 中的資料
- 備份和恢復 HBase 表及其資料分佈,並在不同的 HBase 集群之間移動或複製資料
- 收集指標並以圖表顯示,監控集群狀態,並在超過閾值時獲得通知
- 調整您的內核設置,包括 JVM GC、Hadoop 和 HBase 配置,以最大化性能
- 發現故障排除工具和技巧,以避免 HBase 中最常見的問題
- 通過資料壓縮、區域拆分和手動管理壓縮來獲得最佳性能
- 學習針對讀取和寫入密集型集群的高級配置和調整
方法
作為 Packt 食譜系列的一部分,每個食譜都提供了針對 HBase 管理中常見問題的實用逐步解決方案。
本書的讀者對象
本書適合 HBase 管理員、開發人員,甚至會幫助 Hadoop 管理員。您不需要具備 HBase 經驗,但預期您對 Hadoop 和 MapReduce 有基本的了解。