Audio Processing and Speech Recognition: Concepts, Techniques and Research Overviews (SpringerBriefs in Applied Sciences and Technology)
暫譯: 音頻處理與語音識別:概念、技術與研究概述 (SpringerBriefs in Applied Sciences and Technology)

Soumya Sen, Anjan Dutta, Nilanjan Dey

  • 出版商: Springer
  • 出版日期: 2019-02-20
  • 售價: $2,420
  • 貴賓價: 9.5$2,299
  • 語言: 英文
  • 頁數: 96
  • 裝訂: Paperback
  • ISBN: 9811360979
  • ISBN-13: 9789811360978
  • 相關分類: 語音辨識 Speech-recognition
  • 海外代購書籍(需單獨結帳)

買這商品的人也買了...

商品描述

This book offers an overview of audio processing, including the latest advances in the methodologies used in audio processing and speech recognition. First, it discusses the importance of audio indexing and classical information retrieval problem and presents two major indexing techniques, namely Large Vocabulary Continuous Speech Recognition (LVCSR) and Phonetic Search. It then offers brief insights into the human speech production system and its modeling, which are required to produce artificial speech. It also discusses various components of an automatic speech recognition (ASR) system. 
 
Describing the chronological developments in ASR systems, and briefly examining the statistical models used in ASR as well as the related mathematical deductions, the book summarizes a number of state-of-the-art classification techniques and their application in audio/speech classification. 
 
By providing insights into various aspects of audio/speech processing and speech recognition, this book appeals a wide audience, from researchers and postgraduate students to those new to the field.



商品描述(中文翻譯)

本書提供了音頻處理的概述,包括音頻處理和語音識別中使用的最新方法進展。首先,它討論了音頻索引的重要性以及傳統的信息檢索問題,並介紹了兩種主要的索引技術,即大型詞彙連續語音識別(Large Vocabulary Continuous Speech Recognition, LVCSR)和語音音素搜索(Phonetic Search)。接著,它簡要介紹了人類語音產生系統及其建模,這些是生成人工語音所必需的。它還討論了自動語音識別(Automatic Speech Recognition, ASR)系統的各個組成部分。

本書描述了ASR系統的時間發展,並簡要檢視了ASR中使用的統計模型及相關的數學推導,總結了多種最先進的分類技術及其在音頻/語音分類中的應用。

通過提供對音頻/語音處理和語音識別各個方面的見解,本書吸引了廣泛的讀者群體,從研究人員和研究生到對該領域感興趣的新手。

最後瀏覽商品 (20)