Presentation 2009-12-21
Speaker Adaptation Using Nonlinear Spectral Transformation For Speech Recognition
Toyohiro HAYASHI, Yoshihiko NANKAKU, Akinobu LEE, Keiichi TOKUDA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper proposes a speaker adaptation technique using nonlinear spectral transform based on GMMs. One of the most popular forms of speaker adaptation is based on linear transforms, such as maximum likelihood linear regression(MLLR). In MLLR, model parameters of HMMs are linearly transformed based on the maximum likelihood(ML)fashion by using a small amount of adaptation data. Although multiple transform matrices are used according to the regression class information, only a single linear transform is applied to each state within a regression class. In the proposed technique, we define a new likelihood function combining HMMs for recognition with GMMs for spectral transform and speaker adaptation based on nonlinear transform is performed in the ML fashion. In phoneme recognition experiments, the proposed technique shows better performance than the conventional MLLR approaches.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Speech Recognition / Speaker Adaptation / Nonlinear Spectral Transformation
Paper # NLC2009-12,SP2009-76
Date of Issue

Conference Information
Committee NLC
Conference Date 2009/12/14(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Speaker Adaptation Using Nonlinear Spectral Transformation For Speech Recognition
Sub Title (in English)
Keyword(1) Speech Recognition
Keyword(2) Speaker Adaptation
Keyword(3) Nonlinear Spectral Transformation
1st Author's Name Toyohiro HAYASHI
1st Author's Affiliation Department of Computer Science and Engineering, Nagoya Institute of Technology()
2nd Author's Name Yoshihiko NANKAKU
2nd Author's Affiliation Department of Computer Science and Engineering, Nagoya Institute of Technology
3rd Author's Name Akinobu LEE
3rd Author's Affiliation Department of Computer Science and Engineering, Nagoya Institute of Technology
4th Author's Name Keiichi TOKUDA
4th Author's Affiliation Department of Computer Science and Engineering, Nagoya Institute of Technology
Date 2009-12-21
Paper # NLC2009-12,SP2009-76
Volume (vol) vol.109
Number (no) 355
Page pp.pp.-
#Pages 6
Date of Issue