Presentation | 2011-07-21 One-model Speech Recognition and Synthesis System based on Articulatory Masashi Kimura, Takayuki Onoda, Yurie Iribe, Kouichi Katsurada, Tsuneo Nitta, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Speech recognition (SR) and speech synthesis (SS) based on one-model of articulatory movement HMMs that are commonly applied to both an SR module and an SS module are described. The SR module has an articulatory feature (AF) extractor with multi-layer neural networks (MLNs) that output an AF sequence to HMMs. In the SS module, the speaker-invariant HMMs are applied to generate an articulatory feature (AF) sequence, and then, after converting AFs into vocal tract parameters by using a multi-layer neural network (MLN), a speech signal is synthesized by an LSP (Line Spectrum Pairs) digital filter. CELP coding technique is applied to improve sound quality when generating voice source from embedded codes in the corresponding state of HMMs. The proposed speech synthesis system separate phonetic information and speaker individuality. Therefore, target speaker's voice can be synthesized with a small amount of speech data. The experimental results show that the proposed system can produce good quality speech with only two-sentences. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Articulatory Features / One-model Speech Recognition and Synthesis / CELP codebook / LSP |
Paper # | SP2011-41 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2011/7/14(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | One-model Speech Recognition and Synthesis System based on Articulatory |
Sub Title (in English) | |
Keyword(1) | Articulatory Features |
Keyword(2) | One-model Speech Recognition and Synthesis |
Keyword(3) | CELP codebook |
Keyword(4) | LSP |
1st Author's Name | Masashi Kimura |
1st Author's Affiliation | Graduate school of Engineering, Toyohashi University of Technology() |
2nd Author's Name | Takayuki Onoda |
2nd Author's Affiliation | Graduate school of Engineering, Toyohashi University of Technology |
3rd Author's Name | Yurie Iribe |
3rd Author's Affiliation | Graduate school of Engineering, Toyohashi University of Technology |
4th Author's Name | Kouichi Katsurada |
4th Author's Affiliation | Graduate school of Engineering, Toyohashi University of Technology |
5th Author's Name | Tsuneo Nitta |
5th Author's Affiliation | Graduate school of Engineering, Toyohashi University of Technology |
Date | 2011-07-21 |
Paper # | SP2011-41 |
Volume (vol) | vol.111 |
Number (no) | 153 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |