Pig Design Patterns

Pradeep Pasupuleti

  • 出版商: Packt Publishing
  • 出版日期: 2014-04-14
  • 售價: $2,330
  • 貴賓價: 9.5$2,214
  • 語言: 英文
  • 頁數: 300
  • 裝訂: Paperback
  • ISBN: 1783285559
  • ISBN-13: 9781783285556
  • 相關分類: Design Pattern
  • 下單後立即進貨 (約3~4週)

相關主題

商品描述

Simplify Hadoop programming to create complex end-to-end Enterprise Big Data solutions with Pig

Overview

  • Quickly understand how to use Pig to design end-to-end Big Data systems
  • Implement a hands-on programming approach using design patterns to solve commonly occurring enterprise Big Data challenges
  • Enhances users's capabilities to utilize Pig and create their own design patterns wherever applicable

In Detail

Pig Design Patterns is a comprehensive guide that will enable readers to readily use design patterns that simplify the creation of complex data pipelines in various stages of data management. This book focuses on using Pig in an enterprise context, bridging the gap between theoretical understanding and practical implementation. Each chapter contains a set of design patterns that pose and then solve technical challenges that are relevant to the enterprise use cases.

The book covers the journey of Big Data from the time it enters the enterprise to its eventual use in analytics, in the form of a report or a predictive model. By the end of the book, readers will appreciate Pig's real power in addressing each and every problem encountered when creating an analytics-based data product. Each design pattern comes with a suggested solution, analyzing the trade-offs of implementing the solution in a different way, explaining how the code works, and the results

What you will learn from this book

  • Understand Pig's relevance in an enterprise context
  • Use Pig in design patterns that enable the data movement across platforms during and after analytical processing
  • See how Pig can co-exist with other components of the Hadoop ecosystem to create Big Data solutions using design patterns
  • Simplify the process of creating complex data pipelines using transformations, aggregations, enrichment, cleansing, filtering, reformatting, lookups, and data type conversions
  • Apply the knowledge of Pig in design patterns that deal with integration of Hadoop with other systems to enable multi-platform analytics
  • Comprehend the design patterns and use Pig in cases related to complex analysis of pure structured data

商品描述(中文翻譯)

簡化 Hadoop 程式設計,以使用 Pig 創建複雜的端到端企業大數據解決方案

概述
- 快速了解如何使用 Pig 設計端到端的大數據系統
- 實施以設計模式為基礎的實作編程方法,以解決常見的企業大數據挑戰
- 增強用戶利用 Pig 的能力,並在適用的地方創建自己的設計模式

詳細內容
《Pig 設計模式》是一本全面的指南,將使讀者能夠輕鬆使用設計模式,簡化在各個數據管理階段創建複雜數據管道的過程。本書專注於在企業環境中使用 Pig,彌合理論理解與實際實施之間的差距。每一章都包含一組設計模式,提出並解決與企業用例相關的技術挑戰。

本書涵蓋了大數據從進入企業到最終用於分析(以報告或預測模型的形式)的整個過程。在閱讀完本書後,讀者將能夠欣賞 Pig 在解決創建基於分析的數據產品時所遇到的每一個問題的真正力量。每個設計模式都附有建議的解決方案,分析以不同方式實施該解決方案的權衡,解釋代碼的運作方式及其結果。

你將從本書中學到的內容
- 理解 Pig 在企業環境中的相關性
- 在設計模式中使用 Pig,以便在分析處理期間及之後實現數據在平台之間的流動
- 了解 Pig 如何與 Hadoop 生態系統中的其他組件共存,以使用設計模式創建大數據解決方案
- 簡化使用轉換、聚合、增強、清理、過濾、重新格式化、查找和數據類型轉換創建複雜數據管道的過程
- 應用 Pig 的知識於處理與 Hadoop 與其他系統整合的設計模式,以實現多平台分析
- 理解設計模式並在與純結構化數據的複雜分析相關的案例中使用 Pig