Presentation 2013/6/6
Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise
Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King, Yannis Stylianou,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper presents our entry to a speech-in-noise intelligibility enhancement evaluation: the Hurricane Challenge. The system consists of a Text-To-Speech voice manipulated through a combination of enhancement strategies, each of which is known to be individually successful: a perceptually-motivated spectral shaper based on the Glimpse Proportion measure, dynamic range compression, and adaptation to Lombard excitation and duration patterns. We achieved substantial intelligibility improvements relative to unmodified synthetic speech: 4.9 dB in competing speaker and 4.1 dB in speech-shaped noise. An analysis conducted across this and other two similar evaluations shows that the spectral shaper and the compressor (both of which are loudness boosters) contribute most under higher SNR conditions, particularly for speech-shaped noise. Duration and excitation Lombard-adapted changes are more beneficial in lower SNR conditions, and for competing speaker noise.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) ntelligibility of speech in noise / HMM-based speech synthesis / Lombard speech
Paper # SP203-47,WIT2013-17
Date of Issue

Conference Information
Committee WIT
Conference Date 2013/6/6(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Well-being Information Technology(WIT)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise
Sub Title (in English)
Keyword(1) ntelligibility of speech in noise
Keyword(2) HMM-based speech synthesis
Keyword(3) Lombard speech
1st Author's Name Cassia Valentini-Botinhao
1st Author's Affiliation Centre for Speech Technology Research, University of Edinburgh()
2nd Author's Name Junichi Yamagishi
2nd Author's Affiliation Centre for Speech Technology Research, University of Edinburgh:NII
3rd Author's Name Simon King
3rd Author's Affiliation Centre for Speech Technology Research, University of Edinburgh
4th Author's Name Yannis Stylianou
4th Author's Affiliation Institute of Computer Science, Foundation of Research and Technology Hellas
Date 2013/6/6
Paper # SP203-47,WIT2013-17
Volume (vol) vol.113
Number (no) 77
Page pp.pp.-
#Pages 6
Date of Issue