Presentation | 2014-06-19 Individuality-preserving Voice Conversion for Articulation Disorders Using Sparse Dictionary Learning Ryo AIHARA, Tetsuya TAKIGUCHI, Yasuo ARIKI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | We present in this paper a voice conversion (VC) method for a person with an articulation disorder resulting from athetoid cerebral palsy. The movement of such speakers is limited by their athetoid symptoms, and their consonants are often unstable or unclear, which makes it difficult for them to communicate. In our previous method, exemplar-based spectral conversion using Non-negative Matrix Factorization (NMF) was applied to a voice with an articulation disorder. To preserve the speaker's individuality, we used a combined dictionary that is constructed from the source speaker's vowels and target speaker's consonants. However, in this exemplar-based approach, source speaker's activity matrix which is estimated from input spectra and source speaker's exemplars are used as target speaker's. In this paper, we propose a sparse dictionary learning method for exemplar-based VC and estimate a mapping matrix between source speaker's activity and target speaker's activity. The effectiveness of this method was confirmed by comparing its effectiveness with that of a conventional Gaussian Mixture Model (GMM)-based method and a conventional NMF-based method. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Voice Conversion / Articulation Disorders / Asistive Technology / Non-negative Matrix Factorization |
Paper # | SP2014-53,WIT2014-8 |
Date of Issue |
Conference Information | |
Committee | WIT |
---|---|
Conference Date | 2014/6/12(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Well-being Information Technology(WIT) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Individuality-preserving Voice Conversion for Articulation Disorders Using Sparse Dictionary Learning |
Sub Title (in English) | |
Keyword(1) | Voice Conversion |
Keyword(2) | Articulation Disorders |
Keyword(3) | Asistive Technology |
Keyword(4) | Non-negative Matrix Factorization |
1st Author's Name | Ryo AIHARA |
1st Author's Affiliation | Graduate School of System Informatics, Kobe University() |
2nd Author's Name | Tetsuya TAKIGUCHI |
2nd Author's Affiliation | Organization of Advanced Science and Technology, Kobe University |
3rd Author's Name | Yasuo ARIKI |
3rd Author's Affiliation | Organization of Advanced Science and Technology, Kobe University |
Date | 2014-06-19 |
Paper # | SP2014-53,WIT2014-8 |
Volume (vol) | vol.114 |
Number (no) | 92 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |