Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise

講演名	2013/6/6 Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise , 山岸順一, /,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)
抄録(英)	This paper presents our entry to a speech-in-noise intelligibility enhancement evaluation: the Hurricane Challenge. The system consists of a Text-To-Speech voice manipulated through a combination of enhancement strategies, each of which is known to be individually successful: a perceptually-motivated spectral shaper based on the Glimpse Proportion measure, dynamic range compression, and adaptation to Lombard excitation and duration patterns. We achieved substantial intelligibility improvements relative to unmodified synthetic speech: 4.9 dB in competing speaker and 4.1 dB in speech-shaped noise. An analysis conducted across this and other two similar evaluations shows that the spectral shaper and the compressor (both of which are loudness boosters) contribute most under higher SNR conditions, particularly for speech-shaped noise. Duration and excitation Lombard-adapted changes are more beneficial in lower SNR conditions, and for competing speaker noise.
キーワード(和)
キーワード(英)	ntelligibility of speech in noise / HMM-based speech synthesis / Lombard speech
資料番号	SP203-47,WIT2013-17
発行日

講演論文情報詳細
申込み研究会	Well-being Information Technology(WIT)
本文の言語	ENG
タイトル（和）
サブタイトル（和）
タイトル（英）	Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise
サブタイトル（和）
キーワード(1)（和/英）	/ ntelligibility of speech in noise
第 1 著者氏名（和/英）	/ Cassia Valentini-Botinhao
第 1 著者所属（和/英）	Centre for Speech Technology Research, University of Edinburgh
第 2 著者氏名（和/英）	山岸順一 / Junichi Yamagishi
第 2 著者所属（和/英）	国立情報学研究所コンテンツ科学研究系 Centre for Speech Technology Research, University of Edinburgh:NII
第 3 著者氏名（和/英）	/ / Simon King
第 3 著者所属（和/英）	/ Centre for Speech Technology Research, University of Edinburgh
発表年月日	2013/6/6
資料番号	SP203-47,WIT2013-17
巻番号（vol）	vol.113
号番号（no）	77
ページ範囲	pp.-
ページ数	6
発行日