Presentation | 2001/9/21 Optimizing phase dispersion for excitation source in speech synthesis with STRAIGHT : Psychoacoustical evaluation and optimization of control parameters Hideki IWASAWA, Minoru TSUZAKI, Hisashi KAWAI, Hideki KAWAHARA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | The goal of the STRAIGHT analysis/synthesis system is to produce high quality synthetic speech which is perceptually indistinguishable from the original recording. In some cases the synthetic speech is highly natural, but in others excessive breathiness is remarkable. The main source of the breathiness seems to origin from lack of optimal control of group delay manipulation, through which phase dispersion for the high frequency region of excitation source pulses is realized. Here we report two psychoacoustic evaluation experiments, which showed individual differences of optimal group delay manipulation, as well as its overall tendencies, and what combination of parameter values mimic better(or worse)the original speech. A tentative method of parameter estimation based on physical measure of original speech is suggested. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | analysis/synthesis / psychoacoustic evaluation / voicing source / phase / group delay / voice quality variation |
Paper # | DSP2001-92,SP2001-65 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2001/9/21(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Optimizing phase dispersion for excitation source in speech synthesis with STRAIGHT : Psychoacoustical evaluation and optimization of control parameters |
Sub Title (in English) | |
Keyword(1) | analysis/synthesis |
Keyword(2) | psychoacoustic evaluation |
Keyword(3) | voicing source |
Keyword(4) | phase |
Keyword(5) | group delay |
Keyword(6) | voice quality variation |
1st Author's Name | Hideki IWASAWA |
1st Author's Affiliation | CREST, Japan Science and Technology Corporation() |
2nd Author's Name | Minoru TSUZAKI |
2nd Author's Affiliation | ATR Spoken Language Translation Research Laboratories |
3rd Author's Name | Hisashi KAWAI |
3rd Author's Affiliation | ATR Spoken Language Translation Research Laboratories |
4th Author's Name | Hideki KAWAHARA |
4th Author's Affiliation | Faculty of Systems Engineering, Wakayama University:ATR Information Sciences Division:CREST, Japan Science and Technology Corporation |
Date | 2001/9/21 |
Paper # | DSP2001-92,SP2001-65 |
Volume (vol) | vol.101 |
Number (no) | 325 |
Page | pp.pp.- |
#Pages | 7 |
Date of Issue |