講演名 2013/6/6
Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise
, 山岸 順一, /,
PDFダウンロードページ PDFダウンロードページへ
抄録(和)
抄録(英) This paper presents our entry to a speech-in-noise intelligibility enhancement evaluation: the Hurricane Challenge. The system consists of a Text-To-Speech voice manipulated through a combination of enhancement strategies, each of which is known to be individually successful: a perceptually-motivated spectral shaper based on the Glimpse Proportion measure, dynamic range compression, and adaptation to Lombard excitation and duration patterns. We achieved substantial intelligibility improvements relative to unmodified synthetic speech: 4.9 dB in competing speaker and 4.1 dB in speech-shaped noise. An analysis conducted across this and other two similar evaluations shows that the spectral shaper and the compressor (both of which are loudness boosters) contribute most under higher SNR conditions, particularly for speech-shaped noise. Duration and excitation Lombard-adapted changes are more beneficial in lower SNR conditions, and for competing speaker noise.
キーワード(和)
キーワード(英) ntelligibility of speech in noise / HMM-based speech synthesis / Lombard speech
資料番号 SP203-47,WIT2013-17
発行日

研究会情報
研究会 WIT
開催期間 2013/6/6(から1日開催)
開催地(和)
開催地(英)
テーマ(和)
テーマ(英)
委員長氏名(和)
委員長氏名(英)
副委員長氏名(和)
副委員長氏名(英)
幹事氏名(和)
幹事氏名(英)
幹事補佐氏名(和)
幹事補佐氏名(英)

講演論文情報詳細
申込み研究会 Well-being Information Technology(WIT)
本文の言語 ENG
タイトル(和)
サブタイトル(和)
タイトル(英) Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise
サブタイトル(和)
キーワード(1)(和/英) / ntelligibility of speech in noise
第 1 著者 氏名(和/英) / Cassia Valentini-Botinhao
第 1 著者 所属(和/英)
Centre for Speech Technology Research, University of Edinburgh
第 2 著者 氏名(和/英) 山岸 順一 / Junichi Yamagishi
第 2 著者 所属(和/英) 国立情報学研究所コンテンツ科学研究系
Centre for Speech Technology Research, University of Edinburgh:NII
第 3 著者 氏名(和/英) / / Simon King
第 3 著者 所属(和/英) /
Centre for Speech Technology Research, University of Edinburgh
発表年月日 2013/6/6
資料番号 SP203-47,WIT2013-17
巻番号(vol) vol.113
号番号(no) 77
ページ範囲 pp.-
ページ数 6
発行日