Fuzzy Information Retrieval (Synthesis Lectures on Information Concepts, Retrieval, and Services)
暫譯: 模糊資訊檢索(資訊概念、檢索與服務綜合講座)

Donald H. Kraft, Erin Colvin

  • 出版商: Morgan & Claypool
  • 出版日期: 2017-01-23
  • 售價: $1,460
  • 貴賓價: 9.5$1,387
  • 語言: 英文
  • 頁數: 82
  • 裝訂: Paperback
  • ISBN: 1627059520
  • ISBN-13: 9781627059527
  • 海外代購書籍(需單獨結帳)

商品描述

Information retrieval used to mean looking through thousands of strings of texts to find words or symbols that matched a user's query. Today, there are many models that help index and search more effectively so retrieval takes a lot less time. Information retrieval (IR) is often seen as a subfield of computer science and shares some modeling, applications, storage applications and techniques, as do other disciplines like artificial intelligence, database management, and parallel computing. This book introduces the topic of IR and how it differs from other computer science disciplines. A discussion of the history of modern IR is briefly presented, and the notation of IR as used in this book is defined. The complex notation of relevance is discussed. Some applications of IR is noted as well since IR has many practical uses today. Using information retrieval with fuzzy logic to search for software terms can help find software components and ultimately help increase the reuse of software. This is just one practical application of IR that is covered in this book.

Some of the classical models of IR is presented as a contrast to extending the Boolean model. This includes a brief mention of the source of weights for the various models. In a typical retrieval environment, answers are either yes or no, i.e., on or off. On the other hand, fuzzy logic can bring in a "degree of" match, vs. a crisp, i.e., strict match. This, too, is looked at and explored in much detail, showing how it can be applied to information retrieval. Fuzzy logic is often times considered a soft computing application and this book explores how IR with fuzzy logic and its membership functions as weights can help indexing, querying, and matching. Since fuzzy set theory and logic is explored in IR systems, the explanation of where the fuzz is ensues.

The concept of relevance feedback, including pseudorelevance feedback is explored for the various models of IR. For the extended Boolean model, the use of genetic algorithms for relevance feedback is delved into.

The concept of query expansion is explored using rough set theory. Various term relationships is modeled and presented, and the model extended for fuzzy retrieval. An example using the UMLS terms is also presented. The model is also extended for term relationships beyond synonyms.

Finally, this book looks at clustering, both crisp and fuzzy, to see how that can improve retrieval performance. An example is presented to illustrate the concepts.

商品描述(中文翻譯)

資訊檢索(Information Retrieval, IR)過去是指透過搜尋數千串文本來找到符合使用者查詢的單字或符號。如今,有許多模型可以更有效地進行索引和搜尋,因此檢索所需的時間大幅減少。資訊檢索通常被視為計算機科學的一個子領域,並與其他學科如人工智慧、資料庫管理和平行計算共享一些建模、應用、儲存應用和技術。本書介紹了資訊檢索的主題以及它與其他計算機科學學科的不同之處。簡要介紹了現代資訊檢索的歷史,並定義了本書中使用的資訊檢索符號。討論了相關性的複雜符號。由於資訊檢索在今天有許多實際應用,因此也提到了一些資訊檢索的應用。使用模糊邏輯進行資訊檢索以搜尋軟體術語可以幫助找到軟體元件,最終有助於增加軟體的重用性。這只是本書中涵蓋的資訊檢索的一個實際應用。

本書介紹了一些資訊檢索的經典模型,以對比擴展布林模型。這包括對各種模型的權重來源的簡要提及。在典型的檢索環境中,答案要麼是「是」要麼是「否」,即開或關。另一方面,模糊邏輯可以引入「匹配的程度」,與嚴格的匹配相對。這一點也被詳細探討,顯示它如何應用於資訊檢索。模糊邏輯通常被視為一種軟計算應用,本書探討了如何利用模糊邏輯及其作為權重的隸屬函數來幫助索引、查詢和匹配。由於模糊集合理論和邏輯在資訊檢索系統中被探討,因此也解釋了模糊的來源。

本書探討了相關性反饋的概念,包括偽相關性反饋,針對各種資訊檢索模型進行了探討。對於擴展布林模型,深入探討了使用遺傳演算法進行相關性反饋的應用。

本書使用粗集理論探討查詢擴展的概念。建模並呈現了各種術語關係,並擴展模型以進行模糊檢索。還提供了一個使用UMLS術語的範例。該模型也擴展到超越同義詞的術語關係。

最後,本書探討了聚類,包括嚴格聚類和模糊聚類,以了解這如何改善檢索性能。提供了一個範例來說明這些概念。

最後瀏覽商品 (20)