HMM/BN音響モデルの設計と実装(国際ワークショップ

マルコフ コンスタンティン; 中村 哲

Presentation	2004/12/13 Design and Implementation of HMM/BN Acoustic Models Konstantin MARKOV, Satoshi NAKAMURA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In recent years, the number of studies investigating new directions in speech modeling that goes beyond the conventional HMM has increased considerably. One promising approach is to use Bayesian Networks (BN) as speech model. Full recognition systems based on Dynamic BN as well as acoustic models using BN have been proposed lately. Our group at ATR has been developing the hybrid HMM/BN model which is a HMM where the state probability distribution is modeled by a BN, instead of commonly used mixture of Gaussian functions. In this paper, we describe the hybrid HMM/BN acoustic modeling framework especially emphasizing on some model design and implementation issues. The HMM/BN training is based on the Viterbi training paradigm and consists of two alternating steps - BN training and HMM transitions update. For recognition, in some cases, BN inference is computationally equivalent to mixture of Gaussians which allows HMM/BN model to be used in existing decoders without any modification. We present two examples of HMM/BN model application in speech recognition systems. Evaluations.under various conditions and for different tasks showed that the HMM/BN model gives consistently better performance than the standard mixture of Gaussians HMM.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	HMM/BN / acoustic model / Bayesian network
Paper #	NLC2004-43,SP2004-83
Date of Issue

Conference Information
Committee	NLC
Conference Date	2004/12/13(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Design and Implementation of HMM/BN Acoustic Models
Sub Title (in English)
Keyword(1)	HMM/BN
Keyword(2)	acoustic model
Keyword(3)	Bayesian network
1st Author's Name	Konstantin MARKOV
1st Author's Affiliation	Department of Acoustics and Speech Research, Spoken Language Translation Research Labs. ATR()
2nd Author's Name	Satoshi NAKAMURA
2nd Author's Affiliation	Department of Acoustics and Speech Research, Spoken Language Translation Research Labs. ATR
Date	2004/12/13
Paper #	NLC2004-43,SP2004-83
Volume (vol)	vol.104
Number (no)	538
Page	pp.pp.-
#Pages	6
Date of Issue