Presentation 2016-01-14
Performance evaluation of CRF/HMM-based automatic accent labeling for speech synthesis
Rina Mashiko, Tomoki Koriyama, Takao Kobayashi,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We have proposed an accent type and phrase boundary estimation technique using acoustic and language models represented by HMM and CRF, respectively. It has been shown that naturalness of synthetic speech generated using HMM-based speech synthesis with the accent labels obtained by the proposed technique approaches that with manually annotated accent labels. In this paper, we investigate an optimal choice of weighting factor that determines the balance of acoustic and language models, and examine the effect of using speaker adaptation in the acoustic model training. Then we give a way of obtaining a better acoustic model for accent labeling that leads to more natural sounding synthetic speech. As a result of objective and subjective evaluation, we show that naturalness of synthetic speech can be improved by determining the weighting factor appropriately. Moreover, in the choice of the optimal weighting factor, it is shown that the use of a small amount of speech data of the target speaker would provide a better result.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) accent / speech synthesis / CRF / HMM
Paper # SP2015-85
Date of Issue 2016-01-07 (SP)

Conference Information
Committee SP
Conference Date 2016/1/14(1days)
Place (in Japanese) (See Japanese page)
Place (in English) Sunpian Kawasaki
Topics (in Japanese) (See Japanese page)
Topics (in English) Synthesis, Generation, Prosody, etc.
Chair Kazunori Mano(Shibaura Inst. of Tech.)
Vice Chair Norihide Kitaoka(Tokushima Univ.)
Secretary Norihide Kitaoka(Tokyo City Univ.)
Assistant Takashi Nose(Tohoku Univ.) / Taichi Asami(NTT)

Paper Information
Registration To Technical Committee on Speech
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Performance evaluation of CRF/HMM-based automatic accent labeling for speech synthesis
Sub Title (in English)
Keyword(1) accent
Keyword(2) speech synthesis
Keyword(3) CRF
Keyword(4) HMM
1st Author's Name Rina Mashiko
1st Author's Affiliation Tokyo Institute of Technology(Tokyo Tech)
2nd Author's Name Tomoki Koriyama
2nd Author's Affiliation Tokyo Institute of Technology(Tokyo Tech)
3rd Author's Name Takao Kobayashi
3rd Author's Affiliation Tokyo Institute of Technology(Tokyo Tech)
Date 2016-01-14
Paper # SP2015-85
Volume (vol) vol.115
Number (no) SP-392
Page pp.pp.1-6(SP),
#Pages 6
Date of Issue 2016-01-07 (SP)