Presentation 2008-12-09
Two-channel input speech recognition using sparsness-based blind source separation
Kenta NISHIKI, Yousuke IZUMI, Shinji WATANABE, Takuya NISHIMOTO, Nobutaka ONO, Shigeki SAGAYAMA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper discusses a two-channel input speech recognition using a sparsness-based blind source separation. The target speech is extracted from observed signals under diffusive noises (e.g. reverberation) by the source separation technique where a time-frequency mask is dynamically designed for speech separation using the EM algorithm. Cepstral Mean Normalization is exploited to reduce a remaining distortions or a newly introduced distortions in separated speech features. In a connected digit recognition task with multiple noise sources, the proposed method drastically improved the word accuracy in anechoic and reverberant environments. The proposed method achieved higher performance especially in a reverberant environment than conventional methods.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) sparsness / 2-channel blind source separation / reverberation / speech recognition
Paper # NLC2008-24,SP2008-79
Date of Issue

Conference Information
Committee NLC
Conference Date 2008/12/2(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Two-channel input speech recognition using sparsness-based blind source separation
Sub Title (in English)
Keyword(1) sparsness
Keyword(2) 2-channel blind source separation
Keyword(3) reverberation
Keyword(4) speech recognition
1st Author's Name Kenta NISHIKI
1st Author's Affiliation Department of Information Physics and Computing, University of Tokyo()
2nd Author's Name Yousuke IZUMI
2nd Author's Affiliation Department of Information Physics and Computing, University of Tokyo
3rd Author's Name Shinji WATANABE
3rd Author's Affiliation NTT Communication Science Laboratories
4th Author's Name Takuya NISHIMOTO
4th Author's Affiliation Department of Information Physics and Computing, University of Tokyo
5th Author's Name Nobutaka ONO
5th Author's Affiliation Department of Information Physics and Computing, University of Tokyo
6th Author's Name Shigeki SAGAYAMA
6th Author's Affiliation Department of Information Physics and Computing, University of Tokyo
Date 2008-12-09
Paper # NLC2008-24,SP2008-79
Volume (vol) vol.108
Number (no) 337
Page pp.pp.-
#Pages 6
Date of Issue