F0量子化と非パラレル学習に基づく声質変換の評価(テーマセッション,クロスモーダル)

Presentation	2010-01-21 Performance evaluation of Voice Conversion Based on F0 Quantization and Non-parallel Training Yuhei OTA, Takashi NOSE, Takao KOBAYASHI,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper describes the performance evaluation results of a context-dependent HMM-based voice conversion technique to show its effectiveness by comparing with a GMM-based one. In the HMM-based conversion, first we extract the phonetic and prosodic information from input speech of a source speaker. Then, converted synthetic speech is generated from the pre-trained acoustic model of a target speaker. To appropriately model the pitch information, we use a roughly quantized FO symbol sequence as the prosodic context instead of accent information obtained by manual labeling for training data. By using the phonetically and prosodically context-dependent HMMs, the speaker characteristics appearing in segmental and supra-segmental features can be also converted, which is difficult in conventional GMM-based techniques. Objective and subjective experimental results show that the naturalness and speaker individuality of converted speech are significantly improved by using HMM-based voice conversion.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	voice conversion / HMM-based speech synthesis / prosodic information / F0 quantization / GMM
Paper #	CQ2009-60,PRMU2009-159,SP2009-100,MVE2009-82
Date of Issue

Paper Information
Registration To	Communication Quality (CQ)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Performance evaluation of Voice Conversion Based on F0 Quantization and Non-parallel Training
Sub Title (in English)
Keyword(1)	voice conversion
Keyword(2)	HMM-based speech synthesis
Keyword(3)	prosodic information
Keyword(4)	F0 quantization
Keyword(5)	GMM
1st Author's Name	Yuhei OTA
1st Author's Affiliation	Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology()
2nd Author's Name	Takashi NOSE
2nd Author's Affiliation	Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
3rd Author's Name	Takao KOBAYASHI
3rd Author's Affiliation	Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
Date	2010-01-21
Paper #	CQ2009-60,PRMU2009-159,SP2009-100,MVE2009-82
Volume (vol)	vol.109
Number (no)	373
Page	pp.pp.-
#Pages	6
Date of Issue