Presentation | 2008-03-20 Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments Atsushi HANIU, Masashi UNOKI, Masato AKAGI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper proposes the concept of our novel robust speech recognition method based on the selective sound segregation model, and demonstrates that the proposed method can play an effective role to improve robustness of automatic speech recognition (ASR) systems in various noisy environments. Almost all ASR systems for noise environments attempt to transform an input sound into a clean speech or reference patterns into ones adapted for noises using a noise model, and calculate similarity between an input sound and reference patterns. In our proposed method, the possibility of existence of a target speech in an input sound is employed as a measure of recognition. The possibility of existence of a target speech is calculated by validity of the selective sound segregation model without any noise model. An ASR system based on our proposed method was implemented. To evaluate our proposed ASR system, Japanese digits recognitions in various noisy environments were carried out using traditional ASR systems and the proposed one. Results showed that the proposed method is more robust than other in experimental conditions in SNR = 0 dB. These indicate the proposed method can play an effective role to improve robustness of the ASR systems. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Speech recognition / Noisy environment / Improving robustness / Bregman's regularities / Selective sound segregation |
Paper # | SP2007-196 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2008/3/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments |
Sub Title (in English) | |
Keyword(1) | Speech recognition |
Keyword(2) | Noisy environment |
Keyword(3) | Improving robustness |
Keyword(4) | Bregman's regularities |
Keyword(5) | Selective sound segregation |
1st Author's Name | Atsushi HANIU |
1st Author's Affiliation | School of Information Science, Japan Advanced Institute of Science and Technology() |
2nd Author's Name | Masashi UNOKI |
2nd Author's Affiliation | School of Information Science, Japan Advanced Institute of Science and Technology |
3rd Author's Name | Masato AKAGI |
3rd Author's Affiliation | School of Information Science, Japan Advanced Institute of Science and Technology |
Date | 2008-03-20 |
Paper # | SP2007-196 |
Volume (vol) | vol.107 |
Number (no) | 551 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |