Instant Pentaho Data Integration Kitchen
暫譯: 即時 Pentaho 數據整合廚房

Sergio Ramazzina

  • 出版商: Packt Publishing
  • 出版日期: 2013-07-28
  • 售價: $1,360
  • 貴賓價: 9.5$1,292
  • 語言: 英文
  • 頁數: 68
  • 裝訂: Paperback
  • ISBN: 184969690X
  • ISBN-13: 9781849696906
  • 海外代購書籍(需單獨結帳)

商品描述

Explore the world of Pentaho Data Integration command-line tools which will help you use the Kitchen

Overview

  • Learn something new in an Instant! A short, fast, focused guide delivering immediate results
  • Understand how to discover the repository structure using the command line scripts
  • Learn to configure the log properly and how to gather the information that helps you investigate any kind of problem
  • Explore all the possible ways to start jobs and learn transformations without any difficulty

In Detail

Pentaho PDI is a modern, powerful, and easy-to-use ETL system that lets you develop ETL processes with simplicity. Explore and gain the experience and skills that you need to run processes from the command line or schedule them by using an extensive description and a good set of samples.

Instant Pentaho Data Integration Kitchen How-to will help you to understand the correct way to deal with PDI command line tools. We start with a recipe about how to configure your memory requirements to run your processes effectively and then move forward with a set of recipes that show you the different ways to start PDI processes.

We start with a recap about how transformations and jobs are designed using spoon and then move forward to configure memory requirements to properly run your processes from the command line.

We dive into the various flags that control the logging system by specifying the logging output and the log verbosity. We focus and deliver all the knowledge you require to run the ETL processes using command line tools with ease and in a proficient manner.

What you will learn from this book

  • Understand how to configure memory requirements
  • Discover the PDI repository structure from the command line
  • Explore how to start jobs from a filesystem packed in an archive file
  • Schedule PDI processes on Linux and Windows
  • Master the art of configuring log levels and logging output
  • Start jobs from the repository
  • Get feedback from your process execution

Approach

Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks. A practical guide with easy-to-follow recipes helping developers to quickly and effectively collect data from disparate sources such as databases, files, and applications, and turn the data into a unified format that is accessible and relevant to end users.

Who this book is written for

Any IT professional working on PDI and is a valid support for either learning how to use the command line tools efficiently or for going deeper on some aspects of the command line tools to help you work better.

商品描述(中文翻譯)

探索 Pentaho Data Integration 命令列工具的世界,這將幫助您使用 Kitchen

概述
- 立即學習新知!一本短小、快速、專注的指南,提供即時結果
- 了解如何使用命令列腳本發現資料庫結構
- 學習如何正確配置日誌以及如何收集有助於調查任何問題的信息
- 探索所有可能的啟動作業的方法,輕鬆學習轉換

詳細內容
Pentaho PDI 是一個現代、強大且易於使用的 ETL 系統,讓您能夠簡單地開發 ETL 流程。探索並獲得您需要的經驗和技能,以便從命令列運行流程或使用廣泛的描述和良好的範例進行排程。

《Instant Pentaho Data Integration Kitchen How-to》將幫助您了解正確處理 PDI 命令列工具的方法。我們從如何配置您的記憶體需求以有效運行流程的食譜開始,然後進入一系列顯示啟動 PDI 流程不同方法的食譜。

我們首先回顧如何使用 Spoon 設計轉換和作業,然後進一步配置記憶體需求,以便從命令列正確運行您的流程。

我們深入探討控制日誌系統的各種標誌,通過指定日誌輸出和日誌詳細程度。我們專注並提供您所需的所有知識,以便輕鬆且熟練地使用命令列工具運行 ETL 流程。

您將從本書中學到的內容
- 了解如何配置記憶體需求
- 從命令列發現 PDI 資料庫結構
- 探索如何從打包在壓縮檔案中的檔案系統啟動作業
- 在 Linux 和 Windows 上排程 PDI 流程
- 精通配置日誌級別和日誌輸出的藝術
- 從資料庫啟動作業
- 獲取流程執行的反饋

方法
本書充滿實用的逐步指導和清晰的解釋,針對最重要和有用的任務。一本實用指南,提供易於遵循的食譜,幫助開發人員快速有效地從不同來源(如資料庫、檔案和應用程式)收集數據,並將數據轉換為可訪問且與最終用戶相關的統一格式。

本書的讀者對象
任何從事 PDI 的 IT 專業人員,無論是學習如何有效使用命令列工具,還是深入了解命令列工具的某些方面,以幫助您更好地工作。