Speech Enhancement in the Karhunen-Loeve Expansion Domain (Synthesis Lectures on Speech and Audio Processing)
暫譯: 卡爾霍寧-洛埃夫展開域中的語音增強(語音與音頻處理綜合講座)

Jacob Benesty, Jingdong Chen, Yiteng Huang

  • 出版商: Morgan & Claypool
  • 出版日期: 2011-01-05
  • 售價: $1,600
  • 貴賓價: 9.5$1,520
  • 語言: 英文
  • 頁數: 112
  • 裝訂: Paperback
  • ISBN: 1608456048
  • ISBN-13: 9781608456048
  • 海外代購書籍(需單獨結帳)

商品描述

This book is devoted to the study of the problem of speech enhancement whose objective is the recovery of a signal of interest (i.e., speech) from noisy observations. Typically, the recovery process is accomplished by passing the noisy observations through a linear filter (or a linear transformation). Since both the desired speech and undesired noise are filtered at the same time, the most critical issue of speech enhancement resides in how to design a proper optimal filter that can fully take advantage of the difference between the speech and noise statistics to mitigate the noise effect as much as possible while maintaining the speech perception identical to its original form. The optimal filters can be designed either in the time domain or in a transform space. As the title indicates, this book will focus on developing and analyzing optimal filters in the Karhunen-Loève expansion (KLE) domain. We begin by describing the basic problem of speech enhancement and the fundamental principles to solve it in the time domain. We then explain how the problem can be equivalently formulated in the KLE domain. Next, we divide the general problem in the KLE domain into four groups, depending on whether interframe and interband information is accounted for, leading to four linear models for speech enhancement in the KLE domain. For each model, we introduce signal processing measures to quantify the performance of speech enhancement, discuss the formation of different cost functions, and address the optimization of these cost functions for the derivation of different optimal filters. Both theoretical analysis and experiments will be provided to study the performance of these filters and the links between the KLE-domain and time-domain optimal filters will be examined. Table of Contents: Introduction / Problem Formulation / Optimal Filters in the Time Domain / Linear Models for Signal Enhancement in the KLE Domain / Optimal Filters in the KLE Domain with Model 1 / Optimal Filters in the KLE Domain with Model 2 / Optimal Filters in the KLE Domain with Model 3 / Optimal Filters in the KLE Domain with Model 4 / Experimental Study

商品描述(中文翻譯)

本書專注於語音增強問題的研究,其目標是從噪聲觀測中恢復感興趣的信號(即語音)。通常,恢復過程是通過將噪聲觀測信號通過線性濾波器(或線性變換)來實現的。由於所需的語音和不需要的噪聲同時被濾波,因此語音增強的最關鍵問題在於如何設計一個合適的最佳濾波器,充分利用語音和噪聲統計之間的差異,以盡可能減少噪聲的影響,同時保持語音感知與其原始形式相同。最佳濾波器可以在時間域或變換空間中設計。如標題所示,本書將專注於在 Karhunen-Loève 展開(KLE)域中開發和分析最佳濾波器。我們首先描述語音增強的基本問題及其在時間域中解決的基本原則。然後,我們解釋如何將該問題等效地表述在 KLE 域中。接下來,我們根據是否考慮幀間和頻帶間信息,將 KLE 域中的一般問題分為四組,從而導出四個在 KLE 域中的語音增強線性模型。對於每個模型,我們引入信號處理度量來量化語音增強的性能,討論不同成本函數的形成,並針對這些成本函數的優化進行探討,以推導出不同的最佳濾波器。將提供理論分析和實驗來研究這些濾波器的性能,並檢查 KLE 域與時間域最佳濾波器之間的聯繫。

目錄:引言 / 問題表述 / 時間域中的最佳濾波器 / KLE 域中的信號增強線性模型 / KLE 域中的模型 1 的最佳濾波器 / KLE 域中的模型 2 的最佳濾波器 / KLE 域中的模型 3 的最佳濾波器 / KLE 域中的模型 4 的最佳濾波器 / 實驗研究

最後瀏覽商品 (20)