The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data
暫譯: 文本挖掘手冊:分析非結構化數據的進階方法

Ronen Feldman, James Sanger

  • 出版商: Cambridge
  • 出版日期: 2006-12-11
  • 售價: $1,400
  • 貴賓價: 9.8$1,372
  • 語言: 英文
  • 頁數: 424
  • 裝訂: Hardcover
  • ISBN: 0521836573
  • ISBN-13: 9780521836579
  • 相關分類: Text-mining
  • 無法訂購

買這商品的人也買了...

商品描述

Description 

Text mining is a new and exciting area of computer science research that tries to solve the crisis of information overload by combining techniques from data mining, machine learning, natural language processing, information retrieval, and knowledge management. Similarly, link detection – a rapidly evolving approach to the analysis of text that shares and builds upon many of the key elements of text mining – also provides new tools for people to better leverage their burgeoning textual data resources. The Text Mining Handbook presents a comprehensive discussion of the state-of-the-art in text mining and link detection. In addition to providing an in-depth examination of core text mining and link detection algorithms and operations, the book examines advanced pre-processing techniques, knowledge representation considerations, and visualization approaches. Finally, the book explores current real-world, mission-critical applications of text mining and link detection in such varied fields as M&A business intelligence, genomics research and counter-terrorism activities.

• The first comprehensive compilation of algorithms, methodologies, practical approaches and applications

 • Co-authored by one of the founding figures in the field of text mining

 • Detailed description of core text mining algorithms for identifying patterns such as frequent sets, distributions and proportions and associations


Table of Contents

1. Introduction to text mining; 2. Core text mining operations; 3. Text mining preprocessing techniques; 4. Categorization; 5. Clustering; 6. Information extraction; 7. Probabilistic models for Information extraction; 8. Preprocessing applications using probabilistic and hybrid approaches; 9. Presentation-layer considerations for browsing and query refinement; 10. Visualization approaches; 11. Link analysis; 12. Text mining applications; Appendix; Bibliography.

商品描述(中文翻譯)

**描述**
文本挖掘是一個新穎且令人興奮的計算機科學研究領域,旨在通過結合數據挖掘、機器學習、自然語言處理、信息檢索和知識管理的技術來解決信息過載的危機。同樣,鏈接檢測——一種快速發展的文本分析方法,與文本挖掘的許多關鍵要素相互關聯並建立在其基礎上——也為人們更好地利用日益增長的文本數據資源提供了新工具。《文本挖掘手冊》全面討論了文本挖掘和鏈接檢測的最新技術。除了深入檢視核心文本挖掘和鏈接檢測算法及操作外,本書還探討了先進的預處理技術、知識表示考量和可視化方法。最後,本書探索了文本挖掘和鏈接檢測在併購商業情報、基因組研究和反恐活動等多個領域的當前現實世界、任務關鍵應用。

• 首部全面匯編的算法、方法論、實用方法和應用
• 由文本挖掘領域的創始人物之一共同撰寫
• 詳細描述識別模式的核心文本挖掘算法,如頻繁集、分佈和比例及關聯

**目錄**
1. 文本挖掘簡介;2. 核心文本挖掘操作;3. 文本挖掘預處理技術;4. 分類;5. 聚類;6. 信息提取;7. 信息提取的概率模型;8. 使用概率和混合方法的預處理應用;9. 瀏覽和查詢精煉的展示層考量;10. 可視化方法;11. 鏈接分析;12. 文本挖掘應用;附錄;參考文獻。