Presentation | 2003/5/22 Mixed Speech Recognition Using Microphonearray Toshiyuki SEKIYA, Tetsuji OGAWA, Tetsunori KOBAYASHI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Double-talk recognition under distant microphone condition is one of the serious problems in real environment speech recognition. In this paper, this problem is solved by the microphone-array based BSAS (Band-Selection-based Audio Segregation). In this approach, we prepare some different directivity characteristics using a microphone array, and utilize the difference of these outputs of the array to extract desired speech. We also used generalized harmonic analysis (GHA) instead of FFT for the spectral analysis to improve the performance of BSAS. These modifications enable good segregation in a human auditory sense, but the quality is still insufficient for recognition because some spectral distortion occur in segregation processing. We used MLLR-based acoustic model adaptation and retraining to be robust to the spectral distortion. These efforts enabled 76.2% word accuracy under the condition that the SN ratio is 0 dB, this represents a 45% reduction in the error obtained in the case where only array signal processing was used, and a 30% error reduction compared with when array signal processing and BSAS were used. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | microphone array / band selection / GHA / MLLR |
Paper # | SP2003-22 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2003/5/22(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Mixed Speech Recognition Using Microphonearray |
Sub Title (in English) | |
Keyword(1) | microphone array |
Keyword(2) | band selection |
Keyword(3) | GHA |
Keyword(4) | MLLR |
1st Author's Name | Toshiyuki SEKIYA |
1st Author's Affiliation | School of Science and Engineering, Waseda University() |
2nd Author's Name | Tetsuji OGAWA |
2nd Author's Affiliation | School of Science and Engineering, Waseda University |
3rd Author's Name | Tetsunori KOBAYASHI |
3rd Author's Affiliation | School of Science and Engineering, Waseda University |
Date | 2003/5/22 |
Paper # | SP2003-22 |
Volume (vol) | vol.103 |
Number (no) | 93 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |