Quality of Synthetic Speech: Perceptual Dimensions, Influencing Factors, and Instrumental Assessment (T-Labs Series in Telecommunication Services)
暫譯: 合成語音的品質:感知維度、影響因素與工具評估(T-Labs 電信服務系列)

Florian Hinterleitner

  • 出版商: Springer
  • 出版日期: 2017-04-18
  • 售價: $4,890
  • 貴賓價: 9.5$4,646
  • 語言: 英文
  • 頁數: 157
  • 裝訂: Hardcover
  • ISBN: 9811037337
  • ISBN-13: 9789811037337
  • 相關分類: 通訊系統 Communication-systems
  • 海外代購書籍(需單獨結帳)

商品描述

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.

商品描述(中文翻譯)

本書回顧了合成語音的感知品質維度的研究,並將這些發現與當前技術水平進行比較,推導出五個通用的感知品質維度,適用於文本轉語音(TTS)信號。這五個維度為:(i) 語音的自然性,(ii) 韻律品質,(iii) 流暢性與可懂性,(iv) 無干擾,及 (v) 冷靜。此外,本書介紹了一個測試協議,用於在聆聽測試中有效識別這些維度。此外,還檢視了影響這些維度的幾個因素。最後,介紹、回顧並測試了不同的技術,用於對 TTS 信號的工具品質評估。最後,探討了將工具品質測量整合到串接式 TTS 系統中的要求。