Distant Speech Recognition (Hardcover)

Matthias Woelfel, John McDonough

  • 出版商: Wiley
  • 出版日期: 2009-06-01
  • 售價: $4,730
  • 貴賓價: 9.5$4,494
  • 語言: 英文
  • 頁數: 594
  • 裝訂: Hardcover
  • ISBN: 0470517042
  • ISBN-13: 9780470517048
  • 相關分類: 語音辨識 Speech-recognition
  • 海外代購書籍(需單獨結帳)

買這商品的人也買了...

相關主題

商品描述

A complete overview of distant automatic speech recognition

 

The performance of conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon as the microphone is moved away from the mouth of the speaker. This is due to a broad variety of effects such as background noise, overlapping speech from other speakers, and reverberation. While traditional ASR systems underperform for speech captured with far-field sensors, there are a number of novel techniques within the recognition system as well as techniques developed in other areas of signal processing that can mitigate the deleterious effects of noise and reverberation, as well as separating speech from overlapping speakers.

 

Distant Speech Recognitionpresents a contemporary and comprehensive description of both theoretic abstraction and practical issues inherent in the distant ASR problem.

 

Key Features:

 

  • Covers the entire topic of distant ASR and offers practical solutions to overcome the problems related to it
  • Provides documentation and sample scripts to enable readers to construct state-of-the-art distant speech recognition systems
  • Gives relevant background information in acoustics and filter techniques,
  • Explains the extraction and enhancement of classification relevant speech features
  • Describes maximum likelihood as well as discriminative parameter estimation, and maximum likelihood normalization techniques
  • Discusses the use of multi-microphone configurations for speaker tracking and channel combination
  • Presents several applications of the methods and technologies described in this book
  • Accompanying website with open source software and tools to construct state-of-the-art distant speech recognition systems

 

This reference will be an invaluable resource for researchers, developers, engineers and other professionals, as well as advanced students in speech technology, signal processing, acoustics, statistics and artificial intelligence fields.

商品描述(中文翻譯)

《遠距自動語音識別的完整概述》

傳統的自動語音識別(ASR)系統在麥克風離開說話者嘴巴附近後,性能會急劇下降。這是由於各種效應的影響,例如背景噪音、其他說話者的重疊語音和混響。儘管傳統的ASR系統在遠場感應器捕捉到的語音上表現不佳,但在識別系統內部以及信號處理的其他領域中,有一些新技術可以減輕噪音和混響的不良影響,並分離重疊說話者的語音。

《遠距語音識別》提供了遠距ASR問題中的理論抽象和實際問題的當代全面描述。

主要特點:
- 涵蓋了遠距ASR的整個主題,並提供克服相關問題的實際解決方案
- 提供文檔和示例腳本,使讀者能夠構建最先進的遠距語音識別系統
- 提供聲學和濾波技術的相關背景信息
- 解釋了提取和增強與分類相關的語音特徵
- 描述了最大似然和判別參數估計,以及最大似然歸一化技術
- 討論了多麥克風配置用於說話者追踪和通道組合
- 展示了本書中描述的方法和技術的幾個應用
- 附帶網站,提供開源軟件和工具,用於構建最先進的遠距語音識別系統

這本參考書將成為研究人員、開發人員、工程師和其他專業人士的寶貴資源,以及在語音技術、信號處理、聲學、統計和人工智能領域的高級學生。