Text Mining Application Programming (Paperback)
暫譯: 文本挖掘應用程式設計
Manu Konchady
- 出版商: Charles River Media
- 出版日期: 2006-05-04
- 售價: $2,100
- 貴賓價: 9.5 折 $1,995
- 語言: 英文
- 頁數: 432
- 裝訂: Paperback
- ISBN: 1584504609
- ISBN-13: 9781584504603
-
相關分類:
Text-mining
立即出貨(限量) (庫存=1)
買這商品的人也買了...
-
$480$408 -
$880$695 -
$860$731 -
$880$695 -
$780$663 -
$650$507 -
$680$578 -
$550$468 -
$980$774 -
$750$593 -
$880$695 -
$750$638 -
$680$578 -
$450$383 -
$780$616 -
$720$612 -
$1,200$948 -
$1,400$1,372 -
$990$891 -
$580$452 -
$600$480 -
$1,440Handbook of Digital Forensics and Investigation (Paperback)
-
$1,700$1,615 -
$1,890$1,796 -
$1,683Digital Forensics with Open Source Tools (Paperback)
相關主題
商品描述
Description
Text Mining Application Programming teaches software developers how to mine the vast amounts of information available on the Web, internal networks, and desktop files and turn it into usable data. The book helps developers understand the problems associated with managing unstructured text, and explains how to build your own mining tools using standard statistical methods from Information Theory, Artificial Intelligence, and Operations Research. Each of the topics covered are thoroughly explained and then a practical implementation is provided.
The book begins with a brief overview of text data, where it can be found, and the typical search engines and tools used to search and gather this text. It details how to build tools for extracting and using the text, and covers the mathematics behind many of the algorithms used in building these tools. From there you’ll learn how to build tokens from text, construct indexes, and detect patterns in text. You’ll also find methods to extract the names of people, places, and organizations from an email, a news article, or a web page. The next portion of the book teaches you how to find information on the Web, the structure of the Web, and building spiders to crawl the Web. Text categorization is also described in the context of managing email. The final part of the book covers information monitoring, summarization, and a simple Question & Answer (Q&A) system. The code used in the book is written in Perl, but knowledge of Perl is not necessary to run the software. Developers with an intermediate level of experience with Perl can customize the software. Although the book is about programming, methods are explained with English-like pseudocode and the source code is provided on the CD-ROM.
After reading this book you’ll be ready to tap into the bevy of information available online in ways you never thought possible.
Features
- Teaches developers how to build text mining applications to manage vast amounts of text and turn it into useful data
- Covers key topics such as information extraction, clustering, building spiders, text categorization, summarization, and natural language query systems
- Shows step-by-step techniques for implementing text mining solutions, and provides customizable solutions
商品描述(中文翻譯)
**描述**
《文本挖掘應用程式設計》教導軟體開發人員如何從網路、內部網路和桌面檔案中挖掘大量可用的資訊,並將其轉化為可用的數據。本書幫助開發人員理解管理非結構化文本所面臨的問題,並解釋如何使用資訊理論、人工智慧和運籌學中的標準統計方法來構建自己的挖掘工具。每個主題都進行了詳細的解釋,並提供了實際的實作範例。
本書首先簡要概述了文本數據、其來源以及用於搜尋和收集這些文本的典型搜尋引擎和工具。接著詳細說明如何構建提取和使用文本的工具,並涵蓋了許多用於構建這些工具的算法背後的數學知識。然後,您將學習如何從文本中構建標記、構建索引以及檢測文本中的模式。您還會找到從電子郵件、新聞文章或網頁中提取人名、地名和組織名稱的方法。本書的下一部分教您如何在網路上尋找資訊、網路的結構,以及構建爬蟲來爬取網路。文本分類也在管理電子郵件的背景下進行了描述。本書的最後部分涵蓋了資訊監控、摘要以及一個簡單的問答系統。本書中使用的程式碼是用 Perl 編寫的,但運行軟體並不需要 Perl 知識。具有中級 Perl 經驗的開發人員可以自定義該軟體。雖然本書是關於程式設計,但方法是用類似英語的偽代碼進行解釋,並且源代碼隨附在 CD-ROM 中。
閱讀完本書後,您將準備好以您從未想過的方式利用網上豐富的資訊。
**特點**
- 教導開發人員如何構建文本挖掘應用程式,以管理大量文本並將其轉化為有用的數據
- 涵蓋關鍵主題,如資訊提取、聚類、構建爬蟲、文本分類、摘要和自然語言查詢系統
- 展示逐步實施文本挖掘解決方案的技術,並提供可自定義的解決方案