ニューラルネットワークに基づく差分スペクトルフィルタを用いた任意話者声質変換の検討

小池 治憲; 能勢 隆; 篠崎 隆宏; 伊藤 彰則

Presentation	2015-10-15 A Study on Speaker-Independent Voice Conversion Using Spectral Differential Filter Based on Neural Network Harunori Koike, Takashi Nose, Takahiro Shinozaki, Akinori Ito,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, we propose a novel technique for making the speech individuality of an arbitrary source (input) speaker. The proposed technique use neural network (NN) for many-to-one mapping and the NN is trained with the pairs of multiple source speakers and a target speaker. The conversion of speaker individuality of the input speech is conducted by spectral differential filter. In the previous studies of voice conversion, speaker-dependent approach was proposed where parallel speech data of source and target speakers are used for conversion model training. There is also another approach where the conversion model is trained by using speaker adaptation with a small amount of target speaker's speech. Recently, we proposed speaker-independent voice conversion without using a user's speech in the training step. The purpose of this study is to improve the naturalness of the converted speech in the speaker-independent voice conversion. We directly convert the waveform of the input speaker using a filter whose parameters are obtained by the differential of spectral features before and after feature mapping. An advantage is that the direct waveform conversion alleviate the quality reduction caused by the extraction error of fundamental frequency in the conventional technique. We also show that the naturalness is further improved by variance compensation with affine transformation.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)
Paper #	SP2015-61
Date of Issue	2015-10-08 (SP)

Conference Information
Committee	SP
Conference Date	2015/10/15(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Kobe Univ.
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Speech interface, Synthesis, Dialogue, Application system, etc.
Chair	Kazunori Mano(Shibaura Inst. of Tech.)
Vice Chair	Norihide Kitaoka(Tokushima Univ.)
Secretary	Norihide Kitaoka(Tokyo City Univ.)
Assistant	Takashi Nose(Tohoku Univ.) / Taichi Asami(NTT)

Paper Information
Registration To	Technical Committee on Speech
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	A Study on Speaker-Independent Voice Conversion Using Spectral Differential Filter Based on Neural Network
Sub Title (in English)
Keyword(1)
Keyword(2)
Keyword(3)
Keyword(4)
Keyword(5)
1st Author's Name	Harunori Koike
1st Author's Affiliation	Tohoku University(Tohoku Univ.)
2nd Author's Name	Takashi Nose
2nd Author's Affiliation	Tohoku University(Tohoku Univ.)
3rd Author's Name	Takahiro Shinozaki
3rd Author's Affiliation	Tokyo Institute of Technology(Tokyo Tech)
4th Author's Name	Akinori Ito
4th Author's Affiliation	Tohoku University(Tohoku Univ.)
Date	2015-10-15
Paper #	SP2015-61
Volume (vol)	vol.115
Number (no)	SP-253
Page	pp.pp.13-18(SP),
#Pages	6
Date of Issue	2015-10-08 (SP)