HMM音声合成におけるコンテキストクラスタリング決定木を用いた話者適応の検討(合成, 韻律, 生成, 一般)

Presentation	2003/8/15 Speaker adaptation using context clustering decision tree for HMM-based speech synthesis Junichi YAMAGISHI, Takashi MASUKO, Keiichi TOKUDA, Takao KOBAYASHl,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In order to synthesize speech with arbitrary individualities and/or emotional expressions, segment-based features have to be used as well as frame-based features. In this paper, to realize MLLR (Maximum Likelihood Liner Regression) based speaker adaptation reflecting those segment-based features for HMM-based speech synthesis, we propose a technique for applying context clustering decision trees constructed in a training stage to tying of regression matrices. Since a set of questions used for constructing context clustering decision trees contains questions related to segment-based features such as position and length, it is possible to incorporate segment-based features into the adaptation. We show that synthesized speech from the adapted model using the proposed technique can have segment-based features.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	HMM-based speech synthesis / Speaker adaptation / Maximum likelihood liner regression / Decision tree / Voice characteristics and prosodic features / Segment-based features
Paper #	SP2003-79
Date of Issue

Paper Information
Registration To	Speech (SP)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Speaker adaptation using context clustering decision tree for HMM-based speech synthesis
Sub Title (in English)
Keyword(1)	HMM-based speech synthesis
Keyword(2)	Speaker adaptation
Keyword(3)	Maximum likelihood liner regression
Keyword(4)	Decision tree
Keyword(5)	Voice characteristics and prosodic features
Keyword(6)	Segment-based features
1st Author's Name	Junichi YAMAGISHI
1st Author's Affiliation	Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology()
2nd Author's Name	Takashi MASUKO
2nd Author's Affiliation	Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
3rd Author's Name	Keiichi TOKUDA
3rd Author's Affiliation	Department of Computer Science, Nagoya Institute of Technology
4th Author's Name	Takao KOBAYASHl
4th Author's Affiliation	Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
Date	2003/8/15
Paper #	SP2003-79
Volume (vol)	vol.103
Number (no)	264
Page	pp.pp.-
#Pages	6
Date of Issue