Presentation | 2003/1/24 AN EVALUATION OF AUTOMATIC PHONEME SEGMENTATION FOR CONCATENATIVE SPEECH SYNTHESIS Hisashi KAWAI, Tomoki TODA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper studies the performance of automatic phoneme segmentation in three viewpoints : (1) temporal precision, (2) effects on segment selection, and (3) effects on the naturalness of synthetic speech. The absolute error of thc phoneme beginning time for the best 90% and worst 10% were respectively 4.6ms and 25.9ms, which are comparable to discrepancies among human labelers. Our segment selection algorithm was found to have ability to eliminate waveform segments with large temporal errors, although not perfectly. As the result of a perception test in which naturalness was pair-compared between synthetic speeches generated from hand-labeled data and auto-labeled data, it was found that the difference is marginal in practice although the latter is statistically inferior. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | phoneme segmentation / speech synthesis / corpus-base / segment selection |
Paper # | |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2003/1/24(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | AN EVALUATION OF AUTOMATIC PHONEME SEGMENTATION FOR CONCATENATIVE SPEECH SYNTHESIS |
Sub Title (in English) | |
Keyword(1) | phoneme segmentation |
Keyword(2) | speech synthesis |
Keyword(3) | corpus-base |
Keyword(4) | segment selection |
1st Author's Name | Hisashi KAWAI |
1st Author's Affiliation | Advanced Telecommunications Research Institute International() |
2nd Author's Name | Tomoki TODA |
2nd Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
Date | 2003/1/24 |
Paper # | |
Volume (vol) | vol.102 |
Number (no) | 619 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |