Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments

Presentation	2008-03-20 Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments Atsushi HANIU, Masashi UNOKI, Masato AKAGI,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper proposes the concept of our novel robust speech recognition method based on the selective sound segregation model, and demonstrates that the proposed method can play an effective role to improve robustness of automatic speech recognition (ASR) systems in various noisy environments. Almost all ASR systems for noise environments attempt to transform an input sound into a clean speech or reference patterns into ones adapted for noises using a noise model, and calculate similarity between an input sound and reference patterns. In our proposed method, the possibility of existence of a target speech in an input sound is employed as a measure of recognition. The possibility of existence of a target speech is calculated by validity of the selective sound segregation model without any noise model. An ASR system based on our proposed method was implemented. To evaluate our proposed ASR system, Japanese digits recognitions in various noisy environments were carried out using traditional ASR systems and the proposed one. Results showed that the proposed method is more robust than other in experimental conditions in SNR = 0 dB. These indicate the proposed method can play an effective role to improve robustness of the ASR systems.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Speech recognition / Noisy environment / Improving robustness / Bregman's regularities / Selective sound segregation
Paper #	SP2007-196
Date of Issue

Conference Information
Committee	SP
Conference Date	2008/3/13(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Speech (SP)
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments
Sub Title (in English)
Keyword(1)	Speech recognition
Keyword(2)	Noisy environment
Keyword(3)	Improving robustness
Keyword(4)	Bregman's regularities
Keyword(5)	Selective sound segregation
1st Author's Name	Atsushi HANIU
1st Author's Affiliation	School of Information Science, Japan Advanced Institute of Science and Technology()
2nd Author's Name	Masashi UNOKI
2nd Author's Affiliation	School of Information Science, Japan Advanced Institute of Science and Technology
3rd Author's Name	Masato AKAGI
3rd Author's Affiliation	School of Information Science, Japan Advanced Institute of Science and Technology
Date	2008-03-20
Paper #	SP2007-196
Volume (vol)	vol.107
Number (no)	551
Page	pp.pp.-
#Pages	6
Date of Issue