Presentation 2009-01-30
Mixture of Probabilistic Linear Regressions for Voice Conversion
Yu QIAO, Daisuke SAITO, Nobuaki MINEMATSU,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper introduces a model of Mixture of Probabilistic Linear Regressions (MPLR) to learn a mapping function between two feature spaces. The MPLR consists of weighted combination of several probabilistic linear regressions, whose parameters are estimated by using matrix calculation. The mixture nature of MPLR allows it to model nonlinear transformation. T he formulation of MPLR is general and independent of the types of the density models used. Two well-known GMM-based mapping methods for voice conversion [1],[2] can be regarded as the special cases of MPLR. This unified view not only provides insights to the GMM-based mapping techniques, but also indicates methods to improve them. Compared to [1], our formulation of MPLR avoids solving complex linear equations and yields a faster estimation of the transform parameters. As for [2], the MPLR estimation provides a modified mapping function which overcomes an implicit problem in [2]'s mapping function. We carried out experiments to compare the MPLR-based methods with the traditional GMM-based methods [1],[2] on a voice conversion task. The experimental results show that the MPLR-based methods always have better performance in various parameter setups.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Space mapping / non-linear transform / mixture model / linear regression / voice conversion
Paper # SP2008-139
Date of Issue

Conference Information
Committee SP
Conference Date 2009/1/22(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Mixture of Probabilistic Linear Regressions for Voice Conversion
Sub Title (in English)
Keyword(1) Space mapping
Keyword(2) non-linear transform
Keyword(3) mixture model
Keyword(4) linear regression
Keyword(5) voice conversion
1st Author's Name Yu QIAO
1st Author's Affiliation Grad. School of Engineering, Univ. of Tokyo()
2nd Author's Name Daisuke SAITO
2nd Author's Affiliation Grad. School of Engineering, Univ. of Tokyo
3rd Author's Name Nobuaki MINEMATSU
3rd Author's Affiliation Grad. School of Engineering, Univ. of Tokyo
Date 2009-01-30
Paper # SP2008-139
Volume (vol) vol.108
Number (no) 422
Page pp.pp.-
#Pages 6
Date of Issue