Multiple Non-negative Matrix Factorizationを用いた多対一声質変換(ポスター・デモセッション,第16回音声言語シンポジウム)

相原 龍; 滝口 哲也; 有木 康雄

Presentation	2014-12-16 Many-to-one Voice Conversion using Multiple Non-negative Matrix Factorization Ryo AIHARA, Tetsuya TAKIGUCHI, Yasuo ARIKI,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Voice conversion (VC) is being widely researched in the field of speech processing because of increased interest in using such processing in applications such as personalized Text-To-Speech systems. Statistical approach using Gaussian Mixture Model (GMM) is widely researched in VC and eigen-voice GMM enables one-to-many and many-to-one VC from multiple training data sets. We present in this paper an exemplar-based VC method using Non-negative Matrix Factorization (NMF), which is different from conventional statistical VC. NMF-based VC has advantages of noise robustness and naturalness of converted voice compared to GMM-based VC. However, because NMF-based VC is based on parallel training data of source and target speaker, we cannot covert voice of arbitrary speakers in this framework. In this paper, we propose a many-to-one VC using Multiple Non-negative Matrix Factorization (Multi-NMF). By using Multi-NMF, arbitrary speaker's voice is converted to target speaker's voice without any training data of input speaker's. We assume that this method is flexible because we can adopt it to many-to-many VC or voice quality control.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Voice Conversion / Speech synthesis / Non-negative Matrix Factorization / Exemplar-based / Many-to-one
Paper #	SP2014-114
Date of Issue

Conference Information
Committee	SP
Conference Date	2014/12/8(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Speech (SP)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Many-to-one Voice Conversion using Multiple Non-negative Matrix Factorization
Sub Title (in English)
Keyword(1)	Voice Conversion
Keyword(2)	Speech synthesis
Keyword(3)	Non-negative Matrix Factorization
Keyword(4)	Exemplar-based
Keyword(5)	Many-to-one
1st Author's Name	Ryo AIHARA
1st Author's Affiliation	Graduate School of System Informatics, Kobe University()
2nd Author's Name	Tetsuya TAKIGUCHI
2nd Author's Affiliation	Organization of Advanced Science and Technology, Kobe University
3rd Author's Name	Yasuo ARIKI
3rd Author's Affiliation	Organization of Advanced Science and Technology, Kobe University
Date	2014-12-16
Paper #	SP2014-114
Volume (vol)	vol.114
Number (no)	365
Page	pp.pp.-
#Pages	6
Date of Issue