Presentation 2003/5/22
Mixed Speech Recognition Using Microphonearray
Toshiyuki SEKIYA, Tetsuji OGAWA, Tetsunori KOBAYASHI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Double-talk recognition under distant microphone condition is one of the serious problems in real environment speech recognition. In this paper, this problem is solved by the microphone-array based BSAS (Band-Selection-based Audio Segregation). In this approach, we prepare some different directivity characteristics using a microphone array, and utilize the difference of these outputs of the array to extract desired speech. We also used generalized harmonic analysis (GHA) instead of FFT for the spectral analysis to improve the performance of BSAS. These modifications enable good segregation in a human auditory sense, but the quality is still insufficient for recognition because some spectral distortion occur in segregation processing. We used MLLR-based acoustic model adaptation and retraining to be robust to the spectral distortion. These efforts enabled 76.2% word accuracy under the condition that the SN ratio is 0 dB, this represents a 45% reduction in the error obtained in the case where only array signal processing was used, and a 30% error reduction compared with when array signal processing and BSAS were used.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) microphone array / band selection / GHA / MLLR
Paper # SP2003-22
Date of Issue

Conference Information
Committee SP
Conference Date 2003/5/22(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Mixed Speech Recognition Using Microphonearray
Sub Title (in English)
Keyword(1) microphone array
Keyword(2) band selection
Keyword(3) GHA
Keyword(4) MLLR
1st Author's Name Toshiyuki SEKIYA
1st Author's Affiliation School of Science and Engineering, Waseda University()
2nd Author's Name Tetsuji OGAWA
2nd Author's Affiliation School of Science and Engineering, Waseda University
3rd Author's Name Tetsunori KOBAYASHI
3rd Author's Affiliation School of Science and Engineering, Waseda University
Date 2003/5/22
Paper # SP2003-22
Volume (vol) vol.103
Number (no) 93
Page pp.pp.-
#Pages 6
Date of Issue