話者不変な相対関係特徴を音響単位とする音響モデリングに関する実験的検討(音響モデル,第11回音声言語シンポジウム)

齋藤 大輔; 松浦 良; 峯松 信明; 広瀬 啓吉

Presentation	2009-12-21 Experimental study of acoustic modeling using speaker-invariant speech contrast as modeling unit Daisuke SAITO, Ryo MATSUURA, Nobuaki MINEMATSU, Keikichi HIROSE,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Speech acoustics vary due to differences in age, gender, vocal tract length, microphone, and so on. The authors recently proposed a structural and abstract representation of speech, where these variations were effectively removed. This representation captures only dynamics of speech. In our previous studies, using this abstract representation, an ASR framework, which we call the structure-based ASR, was proposed and examined. However, there are two problems for the structure-based ASR; the curse of dimensionality and the large size of modeling unit. As a solution for these problems, this report proposes a new acoustic modeling based on parameter sharing of statistical structure models and efficient reuse of the shared models. In the proposed method, edge vectors, which represent speech contrasts between any two acoustic events, are considered and parameter sharing is carried out based on clustering in the parametric space of edge vectors. To construct an acoustic model for a new word, the most likely edge models are selected and allocated for each edge vector in the structure of that new word. Experiments of recognition using continuous utterances of Japanese vowels show the validity of the proposed method.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	structural representation / invariant features / speech contrasts / clustering / acoustic modeling
Paper #	NLC2009-13,SP2009-77
Date of Issue

Conference Information
Committee	SP
Conference Date	2009/12/14(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Speech (SP)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Experimental study of acoustic modeling using speaker-invariant speech contrast as modeling unit
Sub Title (in English)
Keyword(1)	structural representation
Keyword(2)	invariant features
Keyword(3)	speech contrasts
Keyword(4)	clustering
Keyword(5)	acoustic modeling
1st Author's Name	Daisuke SAITO
1st Author's Affiliation	Graduate School of Engineering, The University of Tokyo()
2nd Author's Name	Ryo MATSUURA
2nd Author's Affiliation	Graduate School of Frontier Sciences, The University of Tokyo
3rd Author's Name	Nobuaki MINEMATSU
3rd Author's Affiliation	Graduate School of Information Science and Technology, The University of Tokyo
4th Author's Name	Keikichi HIROSE
4th Author's Affiliation	Graduate School of Information Science and Technology, The University of Tokyo
Date	2009-12-21
Paper #	NLC2009-13,SP2009-77
Volume (vol)	vol.109
Number (no)	356
Page	pp.pp.-
#Pages	6
Date of Issue