Similar Languages, Varieties, and Dialects: A Computational Perspective
暫譯: 相似語言、變體與方言:計算視角
Zampieri, Marcos, Nakov, Preslav
商品描述
Language resources and computational models are becoming increasingly important for the study of language variation. A main challenge of this interdisciplinary field is that linguistics researchers may not be familiar with these helpful computational tools and many NLP researchers are often not familiar with language variation phenomena. This essential reference introduces researchers to the necessary computational models for processing similar languages, varieties, and dialects. In this book, leading experts tackle the inherent challenges of the field by balancing a thorough discussion of the theoretical background with a meaningful overview of state-of-the-art language technology. The book can be used in a graduate course, or as a supplementary text for courses on language variation, dialectology, and sociolinguistics or on computational linguistics and NLP. Part 1 covers the linguistic fundamentals of the field such as the question of status and language variation. Part 2 discusses data collection and pre-processing methods. Finally, Part 3 presents NLP applications such as speech processing, machine translation, and language-specific issues in Arabic and Chinese.
商品描述(中文翻譯)
語言資源和計算模型在語言變異研究中變得越來越重要。這個跨學科領域的一個主要挑戰是,語言學研究者可能對這些有用的計算工具不熟悉,而許多自然語言處理(NLP)研究者則往往對語言變異現象不熟悉。本書是一本重要的參考資料,向研究者介紹處理相似語言、變體和方言所需的計算模型。在這本書中,領先的專家通過平衡對理論背景的深入討論與對最先進語言技術的有意義概述,來應對該領域固有的挑戰。本書可用於研究生課程,或作為語言變異、方言學和社會語言學或計算語言學和NLP課程的補充教材。第一部分涵蓋該領域的語言學基礎,例如地位和語言變異的問題。第二部分討論數據收集和預處理方法。最後,第三部分介紹NLP應用,如語音處理、機器翻譯,以及阿拉伯語和中文的語言特定問題。