Presentation | 2014-12-16 Many-to-one Voice Conversion using Multiple Non-negative Matrix Factorization Ryo AIHARA, Tetsuya TAKIGUCHI, Yasuo ARIKI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Voice conversion (VC) is being widely researched in the field of speech processing because of increased interest in using such processing in applications such as personalized Text-To-Speech systems. Statistical approach using Gaussian Mixture Model (GMM) is widely researched in VC and eigen-voice GMM enables one-to-many and many-to-one VC from multiple training data sets. We present in this paper an exemplar-based VC method using Non-negative Matrix Factorization (NMF), which is different from conventional statistical VC. NMF-based VC has advantages of noise robustness and naturalness of converted voice compared to GMM-based VC. However, because NMF-based VC is based on parallel training data of source and target speaker, we cannot covert voice of arbitrary speakers in this framework. In this paper, we propose a many-to-one VC using Multiple Non-negative Matrix Factorization (Multi-NMF). By using Multi-NMF, arbitrary speaker's voice is converted to target speaker's voice without any training data of input speaker's. We assume that this method is flexible because we can adopt it to many-to-many VC or voice quality control. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Voice Conversion / Speech synthesis / Non-negative Matrix Factorization / Exemplar-based / Many-to-one |
Paper # | SP2014-114 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2014/12/8(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Many-to-one Voice Conversion using Multiple Non-negative Matrix Factorization |
Sub Title (in English) | |
Keyword(1) | Voice Conversion |
Keyword(2) | Speech synthesis |
Keyword(3) | Non-negative Matrix Factorization |
Keyword(4) | Exemplar-based |
Keyword(5) | Many-to-one |
1st Author's Name | Ryo AIHARA |
1st Author's Affiliation | Graduate School of System Informatics, Kobe University() |
2nd Author's Name | Tetsuya TAKIGUCHI |
2nd Author's Affiliation | Organization of Advanced Science and Technology, Kobe University |
3rd Author's Name | Yasuo ARIKI |
3rd Author's Affiliation | Organization of Advanced Science and Technology, Kobe University |
Date | 2014-12-16 |
Paper # | SP2014-114 |
Volume (vol) | vol.114 |
Number (no) | 365 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |