Presentation | 2004/12/13 Design and Implementation of HMM/BN Acoustic Models Konstantin MARKOV, Satoshi NAKAMURA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In recent years, the number of studies investigating new directions in speech modeling that goes beyond the conventional HMM has increased considerably. One promising approach is to use Bayesian Networks (BN) as speech model. Full recognition systems based on Dynamic BN as well as acoustic models using BN have been proposed lately. Our group at ATR has been developing the hybrid HMM/BN model which is a HMM where the state probability distribution is modeled by a BN, instead of commonly used mixture of Gaussian functions. In this paper, we describe the hybrid HMM/BN acoustic modeling framework especially emphasizing on some model design and implementation issues. The HMM/BN training is based on the Viterbi training paradigm and consists of two alternating steps - BN training and HMM transitions update. For recognition, in some cases, BN inference is computationally equivalent to mixture of Gaussians which allows HMM/BN model to be used in existing decoders without any modification. We present two examples of HMM/BN model application in speech recognition systems. Evaluations.under various conditions and for different tasks showed that the HMM/BN model gives consistently better performance than the standard mixture of Gaussians HMM. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | HMM/BN / acoustic model / Bayesian network |
Paper # | NLC2004-43,SP2004-83 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2004/12/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Design and Implementation of HMM/BN Acoustic Models |
Sub Title (in English) | |
Keyword(1) | HMM/BN |
Keyword(2) | acoustic model |
Keyword(3) | Bayesian network |
1st Author's Name | Konstantin MARKOV |
1st Author's Affiliation | Department of Acoustics and Speech Research, Spoken Language Translation Research Labs. ATR() |
2nd Author's Name | Satoshi NAKAMURA |
2nd Author's Affiliation | Department of Acoustics and Speech Research, Spoken Language Translation Research Labs. ATR |
Date | 2004/12/13 |
Paper # | NLC2004-43,SP2004-83 |
Volume (vol) | vol.104 |
Number (no) | 538 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |