Apache Sqoop Cookbook (Paperback)
暫譯: Apache Sqoop 食譜 (平裝本)
Kathleen Ting, Jarek Jarcec Cecho
- 出版商: O'Reilly
- 出版日期: 2013-08-20
- 定價: $495
- 售價: 9.5 折 $470
- 語言: 英文
- 頁數: 94
- 裝訂: Paperback
- ISBN: 1449364624
- ISBN-13: 9781449364625
-
相關分類:
Hadoop、大數據 Big-data
立即出貨 (庫存 < 3)
買這商品的人也買了...
-
$249$197 -
$235Hadoop 雲計算實戰
-
$880$748 -
$500$390 -
$580$458 -
$880$695 -
$600$510 -
$580$458 -
$352Hadoop 技術內幕-深入解析 MapReduce 架構設計與實現原理
-
$454Hadoop 技術內幕-深入解析 Hadoop Common 和 HDFS 架構設計與實現原理
-
$1,260Data Warehousing in the Age of Big Data (Paperback)
-
$301HBase 管理指南 (HBase Administration Cookbook)
-
$480$432 -
$680$537 -
$360$284 -
$380$296 -
$360$284 -
$680$530 -
$560$442 -
$500$395 -
$350$277 -
$560$442 -
$490$387 -
$580$458 -
$720$562
商品描述
Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and time-consuming task. This handy cookbook provides dozens of ready-to-use recipes for using Apache Sqoop, the command-line interface application that optimizes data transfers between relational databases and Hadoop.
Sqoop is both powerful and bewildering, but with this cookbook’s problem-solution-discussion format, you’ll quickly learn how to deploy and then apply Sqoop in your environment. The authors provide MySQL, Oracle, and PostgreSQL database examples on GitHub that you can easily adapt for SQL Server, Netezza, Teradata, or other relational systems.
- Transfer data from a single database table into your Hadoop ecosystem
- Keep table data and Hadoop in sync by importing data incrementally
- Import data from more than one database table
- Customize transferred data by calling various database functions
- Export generated, processed, or backed-up data from Hadoop to your database
- Run Sqoop within Oozie, Hadoop’s specialized workflow scheduler
- Load data into Hadoop’s data warehouse (Hive) or database (HBase)
- Handle installation, connection, and syntax issues common to specific database vendors
商品描述(中文翻譯)
整合來自多個來源的數據在大數據時代是至關重要的,但這可能是一項具有挑戰性且耗時的任務。本書提供了數十個現成的食譜,用於使用 Apache Sqoop,這是一個命令行介面應用程式,旨在優化關聯數據庫與 Hadoop 之間的數據傳輸。
Sqoop 既強大又令人困惑,但通過本書的問題-解決-討論格式,您將迅速學會如何在您的環境中部署並應用 Sqoop。作者在 GitHub 上提供了 MySQL、Oracle 和 PostgreSQL 數據庫的範例,您可以輕鬆地將其調整為 SQL Server、Netezza、Teradata 或其他關聯系統。
- 將單個數據庫表中的數據傳輸到您的 Hadoop 生態系統
- 通過增量導入數據來保持表數據與 Hadoop 的同步
- 從多個數據庫表中導入數據
- 通過調用各種數據庫函數自定義傳輸的數據
- 將生成的、處理過的或備份的數據從 Hadoop 導出到您的數據庫
- 在 Oozie 中運行 Sqoop,Oozie 是 Hadoop 的專用工作流調度器
- 將數據加載到 Hadoop 的數據倉庫(Hive)或數據庫(HBase)
- 處理特定數據庫供應商常見的安裝、連接和語法問題