Presentation 2001/12/13
Noise Speech Recognition based on Robust Features and A Model-Based Noise Compensation evaluated on Aurora-2 Task
Kaisheng Yao, Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We have evaluated several feature-based and a model-based method for robust speech recognition in noise. The evaluation was performed on Aurora 2 task. We show that after a sub-band based spectral subtraction, features can be more robust to additive noise. We also report a robust feature set derived from differential power spectrum (DPS), which is not only robust to additive noise, but also robust to spectrum colorization due to channel effects. When the clean training set is available, we show that a model-based noise compensation method can be effective to improve system robustness to noise. Given the testing sets, as a whole, the feature-based methods can yield about 22% relative improvement in accuracy for multi-condition training task, and the model-based method can have about 63% relative performance improvement when systems were trained on clean training set.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Speech recognition / Noise / Robust speech recognition
Paper # NLC2001-56,SP2001-91
Date of Issue

Conference Information
Committee NLC
Conference Date 2001/12/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Noise Speech Recognition based on Robust Features and A Model-Based Noise Compensation evaluated on Aurora-2 Task
Sub Title (in English)
Keyword(1) Speech recognition
Keyword(2) Noise
Keyword(3) Robust speech recognition
1st Author's Name Kaisheng Yao
1st Author's Affiliation ATR Spoken Language Translation Research Laboratories()
2nd Author's Name Jingdong Chen
2nd Author's Affiliation ATR Spoken Language Translation Research Laboratories
3rd Author's Name Kuldip K. Paliwal
3rd Author's Affiliation ATR Spoken Language Translation Research Laboratories:School of Microelectronic Engineering, Griffith University
4th Author's Name Satoshi Nakamura
4th Author's Affiliation ATR Spoken Language Translation Research Laboratories
Date 2001/12/13
Paper # NLC2001-56,SP2001-91
Volume (vol) vol.101
Number (no) 520
Page pp.pp.-
#Pages 6
Date of Issue