Audio and Speech Processing with MATLAB (Paperback)
暫譯: 使用 MATLAB 進行音頻與語音處理 (平裝本)
Hill, Paul
- 出版商: CRC
- 出版日期: 2020-09-30
- 售價: $2,640
- 貴賓價: 9.5 折 $2,508
- 語言: 英文
- 頁數: 330
- 裝訂: Quality Paper - also called trade paper
- ISBN: 0367656310
- ISBN-13: 9780367656317
-
相關分類:
Matlab
-
其他版本:
Audio and Speech Processing with MATLAB
立即出貨 (庫存=1)
買這商品的人也買了...
-
$480$408 -
$350$298 -
$680$612 -
$350$298 -
$980$882 -
$354$336 -
$560$549 -
$780$764 -
$380$296 -
$534$507 -
$500$390 -
$760神經網絡與深度學習
-
$305語音信號處理 (C++版)
-
$500數字信號處理導論 — MATLAB 實現, 2/e
-
$2,060$1,957 -
$2,100$1,995 -
$356傳感器原理與工程應用, 2/e
-
$714$678 -
$709機電一體化設計導論
-
$352傳感器與檢測技術
-
$1,750$1,715 -
$2,106Natural Language Processing with Transformers, Revised Edition (Paperback)
-
$1,250$1,188 -
$1,625Data Augmentation with Python: Enhance deep learning accuracy with data augmentation methods for image, text, audio, and tabular data (Paperback)
-
$1,520Mastering Transformers : The Journey from BERT to Large Language Models and Stable Diffusion, 2/e (Paperback)
相關主題
商品描述
Speech and audio processing has undergone a revolution in preceding decades that has accelerated in the last few years generating game-changing technologies such as truly successful speech recognition systems; a goal that had remained out of reach until very recently. This book gives the reader a comprehensive overview of such contemporary speech and audio processing techniques with an emphasis on practical implementations and illustrations using MATLAB code. Core concepts are firstly covered giving an introduction to the physics of audio and vibration together with their representations using complex numbers, Z transforms and frequency analysis transforms such as the FFT.
Later chapters give a description of the human auditory system and the fundamentals of psychoacoustics. Insights, results, and analyses given in these chapters are subsequently used as the basis of understanding of the middle section of the book covering: wideband audio compression (MP3 audio etc.), speech recognition and speech coding.
The final chapter covers musical synthesis and applications describing methods such as (and giving MATLAB examples of) AM, FM and ring modulation techniques. This chapter gives a final example of the use of time-frequency modification to implement a so-called phase vocoder for time stretching (in MATLAB).
Features
- A comprehensive overview of contemporary speech and audio processing techniques from perceptual and physical acoustic models to a thorough background in relevant digital signal processing techniques together with an exploration of speech and audio applications.
- A carefully paced progression of complexity of the described methods; building, in many cases, from first principles.
- Speech and wideband audio coding together with a description of associated standardised codecs (e.g. MP3, AAC and GSM).
- Speech recognition: Feature extraction (e.g. MFCC features), Hidden Markov Models (HMMs) and deep learning techniques such as Long Short-Time Memory (LSTM) methods.
- Book and computer-based problems at the end of each chapter.
- Contains numerous real-world examples backed up by many MATLAB functions and code.
商品描述(中文翻譯)
語音和音頻處理在過去幾十年中經歷了一場革命,並在最近幾年加速發展,產生了改變遊戲規則的技術,例如真正成功的語音識別系統;這一目標直到最近才變得可及。本書為讀者提供了當代語音和音頻處理技術的全面概述,重點在於使用 MATLAB 代碼的實際實現和示例。首先介紹核心概念,涵蓋音頻和振動的物理學,以及它們使用複數、Z 變換和頻率分析變換(如 FFT)的表示。
後面的章節描述了人類聽覺系統和心理聲學的基本原理。這些章節中提供的見解、結果和分析隨後用作理解本書中間部分的基礎,涵蓋:寬頻音頻壓縮(如 MP3 音頻等)、語音識別和語音編碼。
最後一章涵蓋音樂合成和應用,描述了 AM、FM 和環形調變技術等方法(並提供 MATLAB 示例)。本章最後給出了一個使用時間-頻率修改來實現所謂的相位聲碼器進行時間拉伸的示例(在 MATLAB 中)。
特點
- 從感知和物理聲學模型到相關數字信號處理技術的全面背景,並探索語音和音頻應用的當代語音和音頻處理技術的全面概述。
- 描述方法的複雜性進展經過精心安排;在許多情況下,從基本原理開始構建。
- 語音和寬頻音頻編碼,以及相關標準編解碼器的描述(例如 MP3、AAC 和 GSM)。
- 語音識別:特徵提取(例如 MFCC 特徵)、隱馬爾可夫模型(HMMs)和深度學習技術,如長短期記憶(LSTM)方法。
- 每章結尾都有書本和計算機基礎的問題。
- 包含眾多現實世界的例子,並附有許多 MATLAB 函數和代碼。
作者簡介
Dr Paul Hill received his B.Sc degree from the Open University (1996), an M.Sc degree from the University of Bristol, Bristol, U.K. (1998) and a Ph.D. also from the University of Bristol (2002). His research interests include image and video analysis, compression, fusion and multiscale transforms together with audio applications such as compression, retrieval and signal separation. He is currently a senior research fellow at the Department of Electrical and Electronic Engineering at the University of Bristol. He has taught the speech and audio processing course that the university for over 8 years and has supervised numerous audio MSc projects over that time. He has published over 30 academic papers and is also an amateur musician and composer often reflecting his passion for electronic music in his lectures and presentations.
作者簡介(中文翻譯)
保羅·希爾博士(Dr. Paul Hill)於1996年獲得開放大學(Open University)的理學士學位,1998年獲得英國布里斯托大學(University of Bristol)的理學碩士學位,並於2002年獲得布里斯托大學的博士學位。他的研究興趣包括影像與視頻分析、壓縮、融合及多尺度變換,並涉及音頻應用,如壓縮、檢索和信號分離。目前,他是布里斯托大學電氣與電子工程系的高級研究員。他在該大學教授語音與音頻處理課程已超過8年,並在此期間指導了多個音頻碩士項目。他已發表超過30篇學術論文,並且是一位業餘音樂家和作曲家,經常在他的講座和演示中反映他對電子音樂的熱情。