Multiword Expressions Acquisition: A Generic and Open Framework (Theory and Applications of Natural Language Processing)

Carlos Ramisch

  • 出版商: Springer
  • 出版日期: 2014-10-08
  • 售價: $4,430
  • 貴賓價: 9.5$4,209
  • 語言: 英文
  • 頁數: 230
  • 裝訂: Hardcover
  • ISBN: 3319092065
  • ISBN-13: 9783319092065
  • 相關分類: Word
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

​This book is an excellent introduction to multiword expressions. It provides a unique, comprehensive and up-to-date overview of this exciting topic in computational linguistics. The first part describes the diversity and richness of multiword expressions, including many examples in several languages. These constructions are not only complex and arbitrary, but also much more frequent than one would guess, making them a real nightmare for natural language processing applications. 

The second part introduces a new generic framework for automatic acquisition of multiword expressions from texts. Furthermore, it describes the accompanying free software tool, the mwetoolkit, which comes in handy when looking for expressions in texts (regardless of the language). Evaluation is greatly emphasized, underlining the fact that results depend on parameters like corpus size, language, MWE type, etc. The last part contains solid experimental results and evaluates the mwetoolkit, demonstrating its usefulness for computer-assisted lexicography and machine translation.

This is the first book to cover the whole pipeline of multiword expression acquisition in a single volume. It is addresses the needs of students and researchers in computational and theoretical linguistics, cognitive sciences, artificial intelligence and computer science. Its good balance between computational and linguistic views make it the perfect starting point for anyone interested in multiword expressions, language and text processing in general.

商品描述(中文翻譯)

這本書是多詞表達的優秀入門書籍。它提供了一個獨特、全面且最新的概述,涵蓋了計算語言學中這個令人興奮的主題。第一部分描述了多詞表達的多樣性和豐富性,並包含了多種語言的許多例子。這些結構不僅複雜且隨意,而且出現的頻率遠高於人們的猜測,這使得它們對自然語言處理應用來說是一個真正的噩夢。

第二部分介紹了一個新的通用框架,用於從文本中自動獲取多詞表達。此外,它還描述了隨附的免費軟體工具 mwetoolkit,該工具在尋找文本中的表達時非常有用(無論語言為何)。評估被大大強調,突顯了結果依賴於語料庫大小、語言、多詞表達類型等參數的事實。最後一部分包含了穩固的實驗結果,並評估了 mwetoolkit,展示了其在計算輔助詞典編纂和機器翻譯中的實用性。

這是第一本在單一卷中涵蓋多詞表達獲取整個流程的書籍。它滿足了計算語言學、理論語言學、認知科學、人工智慧和計算機科學領域學生和研究者的需求。其在計算和語言學觀點之間的良好平衡,使其成為任何對多詞表達、語言和文本處理感興趣的人的完美起點。