Building the Unstructured Data Warehouse (Paperback)
暫譯: 建立非結構化數據倉庫 (平裝本)

Name: Building the Unstructured Data Warehouse (Paperback)
Price: 1411 TWD
Availability: InStock
Author: W.H. Inmon, Krish Krishnan
ISBN: 1935504045

W.H. Inmon, Krish Krishnan

出版商: Technics Publication
出版日期: 2011-01-15
售價: $1,485
貴賓價: 9.5 折 $1,411
語言: 英文
頁數: 216
裝訂: Paperback
ISBN: 1935504045
ISBN-13: 9781935504047
相關分類: 大數據 Big-data、資料庫、Data Science

立即出貨 (庫存=1)

買這商品的人也買了...

$1,140

Effective Java, 2/e (Paperback)
~~$420~~ $332

樂高機器人遊樂園篇─LEGO MINDSTORMS NXT 組裝及圖形化程式
$1,290

Game Engine Architecture (Hardcover)
~~$820~~ $648

鳥哥的 Linux 私房菜－基礎學習篇, 3/e
~~$850~~ $723

前進 Android Market！Google Android SDK 實戰演練
~~$480~~ $384

AWS 雲端企業實戰聖經─ Amazon Web Services 改造企業 IT 體質
~~$950~~ $808

Google Android SDK 開發範例大全, 3/e
~~$680~~ $578

王者歸來－C# 4.0 權威指南
~~$490~~ $382

深入淺出 Android 系統移植與開發測試
~~$390~~ $304

培養與鍛鍊程式設計的邏輯腦：世界級程式設計大賽的知識、心得與解題分享
~~$580~~ $458

探索 iPhone 4 程式開發實戰 (Beginning iPhone 4 Development: Exploring the iOS SDK)
~~$880~~ $695

網頁介面設計模式 (Designing Web Interfaces: Principles and Patterns for Rich Interactions)
~~$780~~ $663

徹底研究 Java 程式開發 349 例
~~$580~~ $458

HTML & CSS : 網站設計建置優化之道 (HTML and CSS: Design and Build Websites)
~~$590~~ $460

求職加分！進入 IT 產業必讀的 200 個 .NET 面試決勝題：從求職準備、面試流程、開發心得、重點回顧到經典試題的完整剖析
~~$580~~ $452

無瑕的程式碼－敏捷軟體開發技巧守則 (Clean Code: A Handbook of Agile Software Craftsmanship)
~~$1,130~~ $961

超圖解 Arduino 互動設計入門 (附 Arduino UNO R3 開發板)
~~$400~~ $380

Arduino UNO R3 開發板(副廠相容版)附傳輸線
~~$2,200~~ $2,090

Cloud Computing: Concepts, Technology & Architecture (Hardcover)
~~$550~~ $429

10 天就懂！一定學會 jQuery 的 36 堂關鍵課程
~~$480~~ $379

iOS 7 程式設計實戰－171 個快速上手的開發技巧
~~$320~~ $250

Google 雲端工作術－提升工作效能的 160 個實用技巧
~~$249~~ $212

Microsoft Word 2013 超 EASY !
~~$690~~ $538

Android 大螢幕手機與平板電腦開發實戰：經典範例直擊大螢幕、高解析度的核心處理技術
~~$380~~ $300

實戰 Cacti 網路監控系統－打造高可用性 IT 環境的最佳幫手

商品描述

Learn essential techniques from data warehouse legend Bill Inmon on how to build the reporting environment your business needs now!

Answers for many valuable business questions hide in text. How well can your existing reporting environment extract the necessary text from email, spreadsheets, and documents, and put it in a useful format for analytics and reporting? Transforming the traditional data warehouse into an efficient unstructured data warehouse requires additional skills from the analyst, architect, designer, and developer. This book will prepare you to successfully implement an unstructured data warehouse and, through clear explanations, examples, and case studies, you will learn new techniques and tips to successfully obtain and analyze text.

Master these ten objectives:

Build an unstructured data warehouse using the 11-step approach
Integrate text and describe it in terms of homogeneity, relevance, medium, volume, and structure
Overcome challenges including blather, the Tower of Babel, and lack of natural relationships
Avoid the Data Junkyard and combat the Spider's Web
Reuse techniques perfected in the traditional data warehouse and Data Warehouse 2.0,including iterative development
Apply essential techniques for textual Extract, Transform, and Load (ETL) such as phrase recognition, stop word filtering, and synonym replacement
Design the Document Inventory system and link unstructured text to structured data
Leverage indexes for efficient text analysis and taxonomies for useful external categorization
Manage large volumes of data using advanced techniques such as backward pointers
Evaluate technology choices suitable for unstructured data processing, such as data warehouse appliances

The following outline briefly describes each chapter's content:

Chapter 1 defines unstructured data and explains why text is the main focus of this book.
Chapter 2 addresses the challenges one faces when managing unstructured data.
Chapter 3 discusses the DW 2.0 architecture, which leads into the role of the unstructured data warehouse. The unstructured data warehouse is defined and benefits are given. There are several features of the conventional data warehouse that can be leveraged for the unstructured data warehouse, including ETL processing, textual integration, and iterative development.
Chapter 4 focuses on the heart of the unstructured data warehouse: Textual Extract, Transform, and Load (ETL).
Chapter 5 describes the 11 steps required to develop the unstructured data warehouse.
Chapter 6 describes how to inventory documents for maximum analysis value, as well as link the unstructured text to structured data for even greater value.
Chapter 7 goes through each of the different types of indexes necessary to make text analysis efficient. Indexes range from simple indexes, which are fast to create and are good if the analyst really knows what needs to be analyzed before the indexing process begins, to complex combined indexes, which can be made up of any and all of the other kinds of indexes.
Chapter 8 explains taxonomies and how they can be used within the unstructured data warehouse.
Chapter 9 explains ways of coping with large amounts of unstructured data. Techniques such as keeping the unstructured data at its source and using backward pointers are discussed. The chapter explains why iterative development is so important.
Chapter 10 focuses on challenges and some technology choices that are suitable for unstructured data processing. In addition, the data warehouse appliance is discussed.
Chapters 11, 12, and 13 put all of the previously discussed techniques and approaches in context through three case studies.

商品描述(中文翻譯)

從數據倉儲傳奇人物 Bill Inmon 學習如何建立您當前業務所需的報告環境的基本技術！

許多有價值的商業問題的答案隱藏在文本中。您現有的報告環境能多好地從電子郵件、電子表格和文件中提取必要的文本，並將其轉換為有用的分析和報告格式？將傳統數據倉儲轉變為高效的非結構化數據倉儲需要分析師、架構師、設計師和開發人員額外的技能。本書將幫助您成功實施非結構化數據倉儲，通過清晰的解釋、範例和案例研究，您將學習到成功獲取和分析文本的新技術和技巧。

掌握這十個目標：