Entity Resolution in the Web of Data (Synthesis Lectures on the Semantic Web: Theory and Technology)
暫譯: 數據網絡中的實體解析(語義網:理論與技術綜合講座)
Vassilis Christophides, Vasilis Efthymiou, Kostas Stefanidis
- 出版商: Morgan & Claypool
- 出版日期: 2015-08-01
- 售價: $1,620
- 貴賓價: 9.5 折 $1,539
- 語言: 英文
- 頁數: 124
- 裝訂: Paperback
- ISBN: 1627058036
- ISBN-13: 9781627058032
海外代購書籍(需單獨結帳)
相關主題
商品描述
In recent years, several knowledge bases have been built to enable large-scale knowledge sharing, but also an entity-centric Web search, mixing both structured data and text querying. These knowledge bases offer machine-readable descriptions of real-world entities, e.g., persons, places, published on the Web as Linked Data. However, due to the different information extraction tools and curation policies employed by knowledge bases, multiple, complementary and sometimes conflicting descriptions of the same real-world entities may be provided. Entity resolution aims to identify different descriptions that refer to the same entity appearing either within or across knowledge bases. The objective of this book is to present the new entity resolution challenges stemming from the openness of the Web of data in describing entities by an unbounded number of knowledge bases, the semantic and structural diversity of the descriptions provided across domains even for the same real-world entities, as well as the autonomy of knowledge bases in terms of adopted processes for creating and curating entity descriptions. The scale, diversity, and graph structuring of entity descriptions in the Web of data essentially challenge how two descriptions can be effectively compared for similarity, but also how resolution algorithms can efficiently avoid examining pairwise all descriptions. The book covers a wide spectrum of entity resolution issues at the Web scale, including basic concepts and data structures, main resolution tasks and workflows, as well as state-of-the-art algorithmic techniques and experimental trade-offs.
商品描述(中文翻譯)
近年來,已建立幾個知識庫以促進大規模的知識共享,同時也實現以實體為中心的網路搜尋,結合結構化數據和文本查詢。這些知識庫提供了可機器讀取的現實世界實體的描述,例如,個人、地點,並以連結數據的形式發佈在網路上。然而,由於知識庫所使用的不同信息提取工具和策展政策,可能會提供多個互補且有時相互矛盾的相同現實世界實體的描述。實體解析旨在識別不同的描述,這些描述指向同一實體,無論是在知識庫內部還是跨知識庫之間。本書的目標是呈現由於數據網路的開放性而產生的新實體解析挑戰,這些挑戰源於無限數量的知識庫對實體的描述、即使是相同現實世界實體所提供的描述在語義和結構上的多樣性,以及知識庫在創建和策劃實體描述過程中所採用的自主性。數據網路中實體描述的規模、多樣性和圖形結構本質上挑戰了如何有效比較兩個描述的相似性,也挑戰了解析算法如何高效地避免逐對檢查所有描述。本書涵蓋了網路規模下的廣泛實體解析問題,包括基本概念和數據結構、主要解析任務和工作流程,以及最先進的算法技術和實驗權衡。