Semantic Relations Between Nominals (Synthesis Lectures on Human Language Technologies)
暫譯: 名詞之間的語意關係(人類語言技術綜合講座)
Vivi Nastase, Preslav Nakov, Diarmuid Ó Séaghdha, Stan Szpakowicz
- 出版商: Morgan & Claypool
- 出版日期: 2013-04-01
- 售價: $1,620
- 貴賓價: 9.5 折 $1,539
- 語言: 英文
- 頁數: 120
- 裝訂: Paperback
- ISBN: 1608459799
- ISBN-13: 9781608459797
海外代購書籍(需單獨結帳)
商品描述
People make sense of a text by identifying the semantic relations which connect the entities or concepts described by that text. A system which aspires to human-like performance must also be equipped to identify, and learn from, semantic relations in the texts it processes. Understanding even a simple sentence such as "Opportunity and Curiosity find similar rocks on Mars" requires recognizing relations (rocks are located on Mars, signalled by the word on) and drawing on already known relations (Opportunity and Curiosity are instances of the class of Mars rovers). A language-understanding system should be able to find such relations in documents and progressively build a knowledge base or even an ontology. Resources of this kind assist continuous learning and other advanced language-processing tasks such as text summarization, question answering and machine translation. The book discusses the recognition in text of semantic relations which capture interactions between base noun phrases. After a brief historical background, we introduce a range of relation inventories of varying granularity, which have been proposed by computational linguists. There is also variation in the scale at which systems operate, from snippets all the way to the whole Web, and in the techniques of recognizing relations in texts, from full supervision through weak or distant supervision to self-supervised or completely unsupervised methods. A discussion of supervised learning covers available datasets, feature sets which describe relation instances, and successful algorithms. An overview of weakly supervised and unsupervised learning zooms in on the acquisition of relations from large corpora with hardly any annotated data. We show how bootstrapping from seed examples or patterns scales up to very large text collections on the Web. We also present machine learning techniques in which data redundancy and variability lead to fast and reliable relation extraction. Table of Contents: Introduction / Relations between Nominals, Relations between Concepts / Extracting Semantic Relations with Supervision / Extracting Semantic Relations with Little or No Supervision / Conclusion
商品描述(中文翻譯)
人們通過識別文本中連接實體或概念的語義關係來理解文本。一個渴望達到人類表現的系統也必須具備識別和學習其處理文本中的語義關係的能力。理解即使是簡單的句子,例如「Opportunity 和 Curiosity 在火星上找到相似的岩石」,需要識別關係(岩石位於火星上,由單詞 on 表示)並利用已知的關係(Opportunity 和 Curiosity 是火星探測器的實例)。一個語言理解系統應能夠在文檔中找到這些關係,並逐步建立知識庫或甚至本體。這類資源有助於持續學習以及其他高級語言處理任務,如文本摘要、問答和機器翻譯。本書討論了文本中語義關係的識別,這些關係捕捉了基本名詞短語之間的互動。在簡要的歷史背景介紹後,我們介紹了一系列由計算語言學家提出的不同粒度的關係庫。系統運作的規模也有所不同,從片段到整個網絡,以及識別文本中關係的技術,從完全監督到弱監督或遠程監督,再到自我監督或完全無監督的方法。對監督學習的討論涵蓋了可用數據集、描述關係實例的特徵集和成功的算法。對弱監督和無監督學習的概述聚焦於從幾乎沒有標註數據的大型語料庫中獲取關係。我們展示了如何從種子示例或模式進行自舉,擴展到網絡上非常大的文本集合。我們還介紹了機器學習技術,其中數據冗餘和變異性導致快速且可靠的關係提取。
目錄:引言 / 名詞之間的關係、概念之間的關係 / 使用監督提取語義關係 / 使用少量或無監督提取語義關係 / 結論