Presentation | 2007/10/18 A fundamental study of structure-to-speech conversion Daisuke SAITO, Satoshi ASAKAWA, Nobuaki MINEMATSU, Keikichi HIROSE, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Speech acoustics vary due to differences in age, gender, vocal tract length, microphone, and so on. The authors recently have proposed a structural and abstract representation of speech, where these variations were effectively removed. In this study, a framework of speech synthesis based on this structural representation of speech is proposed. In the proposed framework, a system needs a "speech gestalt" of one utterance, properties of vocal tract length of speaker and properties of transmission of microphone. Using these information, acoustic signals to which hearers listen are generated. This framework can be regarded as a model of vocal imitation of infants. For a fundamental consideration of this framework, the authors considered this framework as a problem of searching cepstrum space for the solutions under some constraints in this report. As results of experiments, speech samples which have proper phonological characteristics were synthesized. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | structural representation / speaker invariant / vocal imitation / language acquisition / searching for solutions |
Paper # | SP2007-80 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2007/10/18(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A fundamental study of structure-to-speech conversion |
Sub Title (in English) | |
Keyword(1) | structural representation |
Keyword(2) | speaker invariant |
Keyword(3) | vocal imitation |
Keyword(4) | language acquisition |
Keyword(5) | searching for solutions |
1st Author's Name | Daisuke SAITO |
1st Author's Affiliation | Graduate School of Frontier Sciences, The University of Tokyo() |
2nd Author's Name | Satoshi ASAKAWA |
2nd Author's Affiliation | Graduate School of Frontier Sciences, The University of Tokyo |
3rd Author's Name | Nobuaki MINEMATSU |
3rd Author's Affiliation | Graduate School of Frontier Sciences, The University of Tokyo |
4th Author's Name | Keikichi HIROSE |
4th Author's Affiliation | Graduate School of Information Science and Technology, The University of Tokyo |
Date | 2007/10/18 |
Paper # | SP2007-80 |
Volume (vol) | vol.107 |
Number (no) | 282 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |