Audio Source Separation and Speech Enhancement
暫譯: 音頻源分離與語音增強

出版商: Wiley
出版日期: 2018-10-22
定價: $4,800
售價: 9.5 折 $4,560
語言: 英文
頁數: 504
裝訂: Hardcover
ISBN: 1119279895
ISBN-13: 9781119279891
相關分類: Machine Learning

立即出貨 (庫存=1)

買這商品的人也買了...

$505

實時語音處理實踐指南
~~$354~~ $336

圖解語音識別
~~$880~~ $695

AI 語音辨識：用 Kaldi 實作應用全集
$403

智能語音處理
~~$890~~ $703

NLP 大神 RNN 網路：Python 原始程式碼手把手帶你寫
~~$880~~ $695

Hey Siri 及 Ok Google原理 - AI語音辨識專案真應用開發
~~$828~~ $787

語音識別：原理與應用, 2/e

商品描述

Learn the technology behind hearing aids, Siri, and Echo

Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software.

Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting.

Key features:

Consolidated perspective on audio source separation and speech enhancement.
Both historical perspective and latest advances in the field, e.g. deep neural networks.
Diverse disciplines: array processing, machine learning, and statistical signal processing.
Covers the most important techniques for both single-channel and multichannel processing.

This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.

商品描述(中文翻譯)

**學習助聽器、Siri 和 Echo 背後的技術**

音頻源分離和語音增強旨在從涉及多個聲音源的音頻錄音中提取一個或多個感興趣的源信號。這些技術是當今音頻信號處理中最受研究的領域之一，並在助聽器、免持電話、語音命令及其他抗噪音音頻分析系統以及音樂後製軟體的成功中扮演著關鍵角色。

這一主題的研究遵循了三條收斂的路徑，分別是傳感器陣列處理、計算聽覺場景分析和基於機器學習的方法，如獨立成分分析。本書是第一本通過在統一的框架中呈現這些技術的共同基礎和差異，提供全面概述的書籍。

主要特點：

- 對音頻源分離和語音增強的綜合視角。
- 包含歷史視角和該領域的最新進展，例如深度神經網絡。
- 涵蓋多個學科：陣列處理、機器學習和統計信號處理。
- 涵蓋單通道和多通道處理的最重要技術。

本書提供了適合具備基本信號處理和機器學習知識的讀者的入門和進階材料。由於其全面性，它將幫助學生選擇有前景的研究方向，幫助研究人員利用所獲得的跨領域知識設計改進技術，並幫助工程師和開發人員選擇適合其目標應用場景的技術。對於來自其他領域（例如聲學、多媒體、語音學和音樂學）的實務工作者，這本書也將有助於他們利用音頻源分離或語音增強作為滿足自身需求的前處理工具。