Presentation 2001/9/21
Optimizing phase dispersion for excitation source in speech synthesis with STRAIGHT : Psychoacoustical evaluation and optimization of control parameters
Hideki IWASAWA, Minoru TSUZAKI, Hisashi KAWAI, Hideki KAWAHARA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The goal of the STRAIGHT analysis/synthesis system is to produce high quality synthetic speech which is perceptually indistinguishable from the original recording. In some cases the synthetic speech is highly natural, but in others excessive breathiness is remarkable. The main source of the breathiness seems to origin from lack of optimal control of group delay manipulation, through which phase dispersion for the high frequency region of excitation source pulses is realized. Here we report two psychoacoustic evaluation experiments, which showed individual differences of optimal group delay manipulation, as well as its overall tendencies, and what combination of parameter values mimic better(or worse)the original speech. A tentative method of parameter estimation based on physical measure of original speech is suggested.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) analysis/synthesis / psychoacoustic evaluation / voicing source / phase / group delay / voice quality variation
Paper # DSP2001-92,SP2001-65
Date of Issue

Conference Information
Committee SP
Conference Date 2001/9/21(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Optimizing phase dispersion for excitation source in speech synthesis with STRAIGHT : Psychoacoustical evaluation and optimization of control parameters
Sub Title (in English)
Keyword(1) analysis/synthesis
Keyword(2) psychoacoustic evaluation
Keyword(3) voicing source
Keyword(4) phase
Keyword(5) group delay
Keyword(6) voice quality variation
1st Author's Name Hideki IWASAWA
1st Author's Affiliation CREST, Japan Science and Technology Corporation()
2nd Author's Name Minoru TSUZAKI
2nd Author's Affiliation ATR Spoken Language Translation Research Laboratories
3rd Author's Name Hisashi KAWAI
3rd Author's Affiliation ATR Spoken Language Translation Research Laboratories
4th Author's Name Hideki KAWAHARA
4th Author's Affiliation Faculty of Systems Engineering, Wakayama University:ATR Information Sciences Division:CREST, Japan Science and Technology Corporation
Date 2001/9/21
Paper # DSP2001-92,SP2001-65
Volume (vol) vol.101
Number (no) 325
Page pp.pp.-
#Pages 7
Date of Issue