Learning Cascading
暫譯: 學習 Cascading
Michael Covert, Victoria Loewengart
- 出版商: Packt Publishing
- 出版日期: 2015-05-30
- 售價: $2,010
- 貴賓價: 9.5 折 $1,910
- 語言: 英文
- 頁數: 281
- 裝訂: Paperback
- ISBN: 1785288911
- ISBN-13: 9781785288913
海外代購書籍(需單獨結帳)
商品描述
Build reliable, robust, and high-performance big data applications using the Cascading application development efficiently
About This Book
- Understand how Cascading fits into the big data landscape and hides the complexity of MapReduce to enable the development of streamlined, maintainable, and concise applications
- Develop a real-life Cascading application that can be easily customized for your specific needs
- Learn basic and advanced features of Cascading through a practical, hands-on approach with step-by-step instructions and code samples
Who This Book Is For
This book is intended for software developers, system architects and analysts, big data project managers, and data scientists who wish to deploy big data solutions using the Cascading framework. You must have a basic understanding of the big data paradigm and should be familiar with Java development techniques.
What You Will Learn
- Familiarize yourself with tuples, pipes, taps, and flows and build your first Cascading application
- Discover how to design, develop, and use custom operations
- Design, develop, use, and reuse code with subassemblies and Cascades
- Acquire the skills you need to integrate Cascading with external systems
- Gain expertise in testing, QA, and performance tuning to run an efficient and successful Cascading project
- Explore project management methodologies and steps to develop workable solutions
- Discover the future of big data frameworks and understand how Cascading can help your software to evolve with it
- Uncover sources of additional information and other tools that can make development tasks a lot easier
In Detail
Cascading is open source software that is used to create and execute complex data processing workflows on big data clusters. The book starts by explaining how Cascading relates to core big data technologies such as Hadoop MapReduce. Having instilled an understanding of the technology, the book provides a comprehensive introduction to the Cascading paradigm and its components using code examples. You will not only learn more advanced Cascading features, you will also write code to utilize them. Furthermore, you will gain in-depth knowledge of how to efficiently optimize a Cascading application. To deepen your knowledge and experience with Cascading, you will work through a real-life case study using Natural Language Processing to perform text analysis and search on large volumes of unstructured text. Throughout the book, you will receive expert advice on how to use the portions of the product that are undocumented or have limited documentation. By the end of the book, you will be able to build practical Cascading applications.
商品描述(中文翻譯)
使用 Cascading 應用程式開發高效地構建可靠、穩健且高效能的大數據應用程式
本書簡介
- 了解 Cascading 如何融入大數據領域,並隱藏 MapReduce 的複雜性,以便開發簡化、可維護且簡潔的應用程式
- 開發一個可輕鬆自訂以滿足特定需求的實際 Cascading 應用程式
- 通過實踐、動手操作的方法,學習 Cascading 的基本和進階功能,並提供逐步指導和程式碼範例
本書適合誰閱讀
本書適合希望使用 Cascading 框架部署大數據解決方案的軟體開發人員、系統架構師和分析師、大數據專案經理以及資料科學家。您必須對大數據範式有基本了解,並應熟悉 Java 開發技術。
您將學到什麼
- 熟悉元組、管道、接點和流程,並構建您的第一個 Cascading 應用程式
- 了解如何設計、開發和使用自訂操作
- 設計、開發、使用和重用帶有子組件和 Cascades 的程式碼
- 獲得將 Cascading 與外部系統整合所需的技能
- 在測試、品質保證和效能調整方面獲得專業知識,以運行高效且成功的 Cascading 專案
- 探索專案管理方法論和開發可行解決方案的步驟
- 了解大數據框架的未來,並理解 Cascading 如何幫助您的軟體隨之演進
- 發現額外資訊來源和其他工具,讓開發任務變得更加輕鬆
詳細內容
Cascading 是一款開源軟體,用於在大數據叢集上創建和執行複雜的資料處理工作流程。本書首先解釋 Cascading 與核心大數據技術(如 Hadoop MapReduce)的關係。在建立對技術的理解後,本書提供了對 Cascading 範式及其組件的全面介紹,並使用程式碼範例進行說明。您不僅會學習到更進階的 Cascading 功能,還會編寫程式碼來利用這些功能。此外,您將深入了解如何有效地優化 Cascading 應用程式。為了加深您對 Cascading 的知識和經驗,您將通過一個實際案例研究,使用自然語言處理對大量非結構化文本進行文本分析和搜索。在整本書中,您將獲得專家建議,了解如何使用未記錄或文檔有限的產品部分。在書籍結束時,您將能夠構建實用的 Cascading 應用程式。