AURORA-2 タスク雑音環境下音声認識における雑音にロバストな特徴抽出法とモデル補償

Presentation	2001/12/13 Noise Speech Recognition based on Robust Features and A Model-Based Noise Compensation evaluated on Aurora-2 Task Kaisheng Yao, Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	We have evaluated several feature-based and a model-based method for robust speech recognition in noise. The evaluation was performed on Aurora 2 task. We show that after a sub-band based spectral subtraction, features can be more robust to additive noise. We also report a robust feature set derived from differential power spectrum (DPS), which is not only robust to additive noise, but also robust to spectrum colorization due to channel effects. When the clean training set is available, we show that a model-based noise compensation method can be effective to improve system robustness to noise. Given the testing sets, as a whole, the feature-based methods can yield about 22% relative improvement in accuracy for multi-condition training task, and the model-based method can have about 63% relative performance improvement when systems were trained on clean training set.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Speech recognition / Noise / Robust speech recognition
Paper #	NLC2001-56,SP2001-91
Date of Issue

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Noise Speech Recognition based on Robust Features and A Model-Based Noise Compensation evaluated on Aurora-2 Task
Sub Title (in English)
Keyword(1)	Speech recognition
Keyword(2)	Noise
Keyword(3)	Robust speech recognition
1st Author's Name	Kaisheng Yao
1st Author's Affiliation	ATR Spoken Language Translation Research Laboratories()
2nd Author's Name	Jingdong Chen
2nd Author's Affiliation	ATR Spoken Language Translation Research Laboratories
3rd Author's Name	Kuldip K. Paliwal
3rd Author's Affiliation	ATR Spoken Language Translation Research Laboratories:School of Microelectronic Engineering, Griffith University
4th Author's Name	Satoshi Nakamura
4th Author's Affiliation	ATR Spoken Language Translation Research Laboratories
Date	2001/12/13
Paper #	NLC2001-56,SP2001-91
Volume (vol)	vol.101
Number (no)	520
Page	pp.pp.-
#Pages	6
Date of Issue