Automatic Speech Recognition and Translation for Low Resource Languages
Kumar, L. Ashok, Renuka, D. Karthika, Chakravarthi, Bharathi Raja
- 出版商: Wiley-Scrivener
- 出版日期: 2024-04-30
- 售價: $7,700
- 貴賓價: 9.5 折 $7,315
- 語言: 英文
- 頁數: 496
- 裝訂: Hardcover - also called cloth, retail trade, or trade
- ISBN: 1394213581
- ISBN-13: 9781394213580
-
相關分類:
語音辨識 Speech-recognition
無法訂購
相關主題
商品描述
This book is a comprehensive exploration into the cutting-edge research, methodologies, and advancements in addressing the unique challenges associated with ASR and translation for low-resource languages.
Automatic Speech Recognition and Translation for Low Resource Languages contains groundbreaking research from experts and researchers sharing innovative solutions that address language challenges in low-resource environments. The book begins by delving into the fundamental concepts of ASR and translation, providing readers with a solid foundation for understanding the subsequent chapters. It then explores the intricacies of low-resource languages, analyzing the factors that contribute to their challenges and the significance of developing tailored solutions to overcome them.
The chapters encompass a wide range of topics, ranging from both the theoretical and practical aspects of ASR and translation for low-resource languages. The book discusses data augmentation techniques, transfer learning, and multilingual training approaches that leverage the power of existing linguistic resources to improve accuracy and performance. Additionally, it investigates the possibilities offered by unsupervised and semi-supervised learning, as well as the benefits of active learning and crowdsourcing in enriching the training data. Throughout the book, emphasis is placed on the importance of considering the cultural and linguistic context of low-resource languages, recognizing the unique nuances and intricacies that influence accurate ASR and translation. Furthermore, the book explores the potential impact of these technologies in various domains, such as healthcare, education, and commerce, empowering individuals and communities by breaking down language barriers.
Audience
The book targets researchers and professionals in the fields of natural language processing, computational linguistics, and speech technology. It will also be of interest to engineers, linguists, and individuals in industries and organizations working on cross-lingual communication, accessibility, and global connectivity.
商品描述(中文翻譯)
自動語音辨識與低資源語言翻譯
本書全面探討了在解決低資源語言的自動語音辨識(ASR)和翻譯所面臨的獨特挑戰方面的前沿研究、方法論和進展。
《自動語音辨識與低資源語言翻譯》包含了來自專家和研究人員的開創性研究,分享了針對低資源環境中語言挑戰的創新解決方案。本書首先深入探討ASR和翻譯的基本概念,為讀者提供了理解後續章節的堅實基礎。接著,書中探討了低資源語言的複雜性,分析了導致其挑戰的因素,以及開發量身定制解決方案以克服這些挑戰的重要性。
各章涵蓋了廣泛的主題,涉及低資源語言的ASR和翻譯的理論和實踐方面。本書討論了數據增強技術、遷移學習和多語言訓練方法,利用現有語言資源的力量來提高準確性和性能。此外,還探討了無監督和半監督學習所提供的可能性,以及主動學習和眾包在豐富訓練數據方面的好處。在整本書中,強調了考慮低資源語言的文化和語言背景的重要性,認識到影響準確ASR和翻譯的獨特細微差別和複雜性。此外,本書還探討了這些技術在各個領域的潛在影響,如醫療、教育和商業,通過打破語言障礙來賦能個人和社區。
讀者對象
本書的目標讀者是自然語言處理、計算語言學和語音技術領域的研究人員和專業人士。對於從事跨語言交流、可及性和全球連接的工程師、語言學家以及相關行業和組織的個人也將感興趣。
作者簡介
L. Ashok Kumar, PhD, is a professor in the Department of Electrical and Electronics Engineering, PSG of Technology, Tamil Nadu, India. He has published more than 175 papers in international and national journals and received 26 awards for his PhD project on wearable electronics at national and international levels. He has created eight Centres of Excellence at PSG in collaboration with government agencies and industries such as the Centre for Audio Visual Speech Recognition and the Centre for Excellence in Solar Thermal Systems. Twenty-three out of 27 of his products have been technologically transferred to government funding agencies.
D. Karthika Renuka, PhD, is a professor at PSG of Technology, Tamil Nadu, India. Her main areas of study focus on data mining, evolutionary algorithms, and machine learning. She is a recipient of the Indo-U.S. Fellowship for Women in STEMM. She has organized two international conferences on The Innovation of Computing Techniques and Information Processing and Remote Computing.
Bharathi Raja Chakravarthi, PhD, is an assistant professor in the School of Computer Science, University of Galway, Ireland. His studies focus on multimodal machine learning, abusive/offensive language detection, bias in natural language processing tasks, inclusive language detection, and multilingualism. He has published many papers in international journals and conferences. He is an associate editor of the journal Expert System with Application and an editorial board member for Computer Speech & Language.
Thomas Mandl, PhD, is a professor of Information Science and Language Technology, University of Hildesheim, Germany. His research interests include information retrieval, human-computer interaction, and internationalization of information technology and he has published more than 300 papers on these topics. He coordinated tracks at the Cross Language Evaluation Forum (CLEF), the European information retrieval evaluation initiative. Thomas Mandl is the co-chair at FIRE, the evaluation initiative for Indian languages, since 2020 and coordinates the HASOC track on hate speech detection.
作者簡介(中文翻譯)
L. Ashok Kumar, PhD,是印度泰米爾納德邦PSG科技大學電機與電子工程系的教授。他在國際和國內期刊上發表了超過175篇論文,並因其在可穿戴電子產品方面的博士專案獲得了26項國內外獎項。他與政府機構及產業合作,在PSG創建了八個卓越中心,如視聽語音識別中心和太陽熱系統卓越中心。他的27項產品中有23項已經技術轉移給政府資助機構。
D. Karthika Renuka, PhD,是印度泰米爾納德邦PSG科技大學的教授。她的主要研究領域集中在數據挖掘、演化算法和機器學習。她是印度-美國STEMM女性獎學金的獲得者。她曾組織過兩場國際會議,主題為《計算技術的創新》和《信息處理與遠程計算》。
Bharathi Raja Chakravarthi, PhD,是愛爾蘭高威大學計算機科學學院的助理教授。他的研究集中在多模態機器學習、辱罵/攻擊性語言檢測、自然語言處理任務中的偏見、包容性語言檢測和多語言能力。他在國際期刊和會議上發表了多篇論文。他是期刊《Expert System with Application》的副編輯,並擔任《Computer Speech & Language》的編輯委員會成員。
Thomas Mandl, PhD,是德國希爾德斯海姆大學信息科學與語言技術的教授。他的研究興趣包括信息檢索、人機互動以及信息技術的國際化,並在這些主題上發表了超過300篇論文。他在跨語言評估論壇(CLEF)協調了多個專題,該論壇是歐洲的信息檢索評估倡議。自2020年以來,Thomas Mandl擔任印度語言評估倡議FIRE的共同主席,並協調針對仇恨言論檢測的HASOC專題。