HMMに基づく音声合成におけるスペクトル・ピッチ・状態継続長の同時モデル化

Presentation	1999/8/5 Simultaneous Modeling of Spectrum, Pitch and State Duration in HMM-based Speech Synthesis Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, we describe an HMM-based speech synthesis system in which spectrum, pitch and state duration are modeled simultaneously in a unified framework of HMM. In the system, spectrum, pitch and state duration are modeled by continuous density HMMs, multi-space probability distribution HMMs and multi-dimensional Gaussian distributions, respectively. The distributions for spectral parameter, pitch parameter and the state duration are clustered independently by using a decision-tree based context clustering technique. Synthetic speech is generated by using an speech parameter generation algorithm from HMM and a mel-cepstrum based vocoding technique. Through informal listening tests, we have confirmed that the proposed system successfully synthesizes natural-sounding speech which resembles the speaker in the training database.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	HMM / text-to-speech synthesis / mel-cepstrum / prosodic model / context clustering
Paper #	SP99-59
Date of Issue

Paper Information
Registration To	Speech (SP)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Simultaneous Modeling of Spectrum, Pitch and State Duration in HMM-based Speech Synthesis
Sub Title (in English)
Keyword(1)	HMM
Keyword(2)	text-to-speech synthesis
Keyword(3)	mel-cepstrum
Keyword(4)	prosodic model
Keyword(5)	context clustering
1st Author's Name	Takayoshi Yoshimura
1st Author's Affiliation	Department of Computer Science, Nagoya Inst. of Tech.()
2nd Author's Name	Keiichi Tokuda
2nd Author's Affiliation	Department of Computer Science, Nagoya Inst. of Tech.
3rd Author's Name	Takashi Masuko
3rd Author's Affiliation	Interdisciplinary Graduate School of Science and Engineering,Tokyo Inst.of Tech.
4th Author's Name	Takao Kobayashi
4th Author's Affiliation	Interdisciplinary Graduate School of Science and Engineering,Tokyo Inst.of Tech.
5th Author's Name	Tadashi Kitamura
5th Author's Affiliation	Department of Computer Science, Nagoya Inst. of Tech.
Date	1999/8/5
Paper #	SP99-59
Volume (vol)	vol.99
Number (no)	255
Page	pp.pp.-
#Pages	6
Date of Issue