Speech Coding: With Code-Excited Linear Prediction
暫譯: 語音編碼:基於碼激勵線性預測
Backstrom, Tom
- 出版商: Springer
- 出版日期: 2018-07-20
- 售價: $5,380
- 貴賓價: 9.5 折 $5,111
- 語言: 英文
- 頁數: 240
- 裝訂: Quality Paper - also called trade paper
- ISBN: 3319843443
- ISBN-13: 9783319843445
海外代購書籍(需單獨結帳)
相關主題
商品描述
This book provides scientific understanding of the most central techniques used in speech coding both for advanced students as well as professionals with a background in speech audio and or digital signal processing. It provides a clear connection between the Why's?, How's?, and What's, such that the necessity, purpose and solutions provided by tools should be always within sight, as well as their strengths and weaknesses in each respect. Equivalently, this book sheds light on the following perspectives for each technology presented:
Objective: What do we want to achieve and especially why is this goal important?
Resource / Information: What information is available and how can it be useful?
Resource / Platform: What kind of platforms are we working with and what are the capabilities/restrictions of those platforms? This includes properties such as computational, memory, acoustic and transmission capacity of devices used.
Solutions: Which solutions have been proposed and how can they be used to reach the stated goals?
Strengths and weaknesses: In which ways do the solutions fulfill the objectives and where are they insufficient? Are resources used efficiently?
This book concentrates solely on code excited linear prediction and its derivatives since mainstream speech codecs are based on linear prediction It also concentrates exclusively on time domain techniques because frequency domain tools are to a large extent common with audio codecs.
商品描述(中文翻譯)
本書提供了對語音編碼中最核心技術的科學理解,適合具有語音音頻或數位信號處理背景的進階學生及專業人士。它清楚地連結了「為什麼?」「怎麼做?」「是什麼?」這些問題,使得工具所提供的必要性、目的和解決方案始終在視野之內,並且分析了它們在各方面的優勢和劣勢。同樣地,本書針對每項技術提供了以下幾個觀點:
目標:我們想要達成什麼,尤其是這個目標為什麼重要?
資源/資訊:有哪些資訊可用,這些資訊如何能夠有用?
資源/平台:我們正在使用什麼樣的平台,這些平台的能力/限制是什麼?這包括設備的計算能力、記憶體、聲學和傳輸能力等屬性。
解決方案:有哪些解決方案被提出,這些解決方案如何能夠用來達成所述的目標?
優勢和劣勢:這些解決方案在何種程度上滿足了目標,在哪些方面又顯得不足?資源是否被有效利用?
本書專注於代碼激發線性預測及其衍生技術,因為主流語音編碼器是基於線性預測的。它也專注於時域技術,因為頻域工具在很大程度上與音頻編碼器是共通的。
作者簡介
Tom Bäckström is Professor for Speech Coding at University of Erlangen-Nuremberg and Member of the International Audio Labs Erlangen, funded by Fraunhofer IIS. He is active as a researcher in mathematical methods in the modeling of the voice and audio. His interests are in developing the mathematical side even more in the intersection of digital signal processing, matrix and polynomial algebra and functional analysis.
作者簡介(中文翻譯)
湯姆·貝克斯特倫(Tom Bäckström)是埃爾朗根-紐倫堡大學(University of Erlangen-Nuremberg)語音編碼教授,也是由弗勞恩霍夫音頻與媒體技術研究所(Fraunhofer IIS)資助的國際音頻實驗室(International Audio Labs Erlangen)成員。他在語音和音頻建模的數學方法研究方面非常活躍。他的興趣在於進一步發展數字信號處理、矩陣和多項式代數以及泛函分析交叉領域的數學部分。