Speech Recognition System using MATLAB: Design, Implementation, and Samples Codes (Paperback)
暫譯: 使用 MATLAB 的語音識別系統:設計、實作與範例程式碼 (平裝本)
Ahmad A. M. Abushariah, Teddy Surya Gunawan
- 出版商: LAP LAMBERT
- 出版日期: 2011-09-16
- 售價: $2,130
- 貴賓價: 9.5 折 $2,024
- 語言: 英文
- 頁數: 112
- 裝訂: Paperback
- ISBN: 3846503762
- ISBN-13: 9783846503768
-
相關分類:
Matlab、語音辨識 Speech-recognition
海外代購書籍(需單獨結帳)
買這商品的人也買了...
-
$4,240$4,028 -
$1,980$1,881 -
$1,400$1,330 -
$4,130$3,924 -
$360$281 -
$480$379 -
$580$458 -
$620$484
相關主題
商品描述
Research in automatic speech recognition has been done for almost four decades. This project aims to develop automated English digits speech recognition system using Matlab. The system is able to recognize the spoken utterances by translating the speech waveform into a set of feature vectors using Mel Frequency Cepstral Coefficients (MFCC) technique, which then estimates the observation likelihood by using the Forward algorithm. The Hidden Markov Model (HMM) parameters are estimated by applying the Baum-Welch algorithm on previously trained samples. The most likely sequence is then decoded using Viterbi algorithm, thus producing the recognized word. This project focuses on all English digits from (Zero through Nine), which is based on isolated words structure. Two modules were developed, namely the isolated words speech recognition and the continuous speech recognition. Both modules were tested in both clean and noisy environments and showed relatively sucessful recognition rates. The samples of Matlab codes were provided in the Appendix.
商品描述(中文翻譯)
自動語音辨識的研究已經進行了近四十年。本專案旨在使用 Matlab 開發自動化的英語數字語音辨識系統。該系統能夠通過將語音波形轉換為一組特徵向量來識別所說的話,這一過程使用了梅爾頻率倒譜係數(Mel Frequency Cepstral Coefficients, MFCC)技術,然後利用前向演算法(Forward algorithm)估計觀察的可能性。隱馬可夫模型(Hidden Markov Model, HMM)的參數是通過對先前訓練樣本應用 Baum-Welch 演算法來估計的。然後,使用維特比演算法(Viterbi algorithm)解碼出最可能的序列,從而產生識別出的單詞。本專案專注於所有英語數字(從零到九),基於獨立單詞結構。開發了兩個模組,即獨立單詞語音辨識和連續語音辨識。這兩個模組在乾淨和嘈雜的環境中進行了測試,並顯示出相對成功的識別率。Matlab 代碼的範例已在附錄中提供。