スパース性に基づくブラインド音源分離を用いた2チャンネル入力音声認識(音響処理・話者同定,第10回音声言語シンポジウム)

Presentation	2008-12-09 Two-channel input speech recognition using sparsness-based blind source separation Kenta NISHIKI, Yousuke IZUMI, Shinji WATANABE, Takuya NISHIMOTO, Nobutaka ONO, Shigeki SAGAYAMA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper discusses a two-channel input speech recognition using a sparsness-based blind source separation. The target speech is extracted from observed signals under diffusive noises (e.g. reverberation) by the source separation technique where a time-frequency mask is dynamically designed for speech separation using the EM algorithm. Cepstral Mean Normalization is exploited to reduce a remaining distortions or a newly introduced distortions in separated speech features. In a connected digit recognition task with multiple noise sources, the proposed method drastically improved the word accuracy in anechoic and reverberant environments. The proposed method achieved higher performance especially in a reverberant environment than conventional methods.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	sparsness / 2-channel blind source separation / reverberation / speech recognition
Paper #	NLC2008-24,SP2008-79
Date of Issue

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Two-channel input speech recognition using sparsness-based blind source separation
Sub Title (in English)
Keyword(1)	sparsness
Keyword(2)	2-channel blind source separation
Keyword(3)	reverberation
Keyword(4)	speech recognition
1st Author's Name	Kenta NISHIKI
1st Author's Affiliation	Department of Information Physics and Computing, University of Tokyo()
2nd Author's Name	Yousuke IZUMI
2nd Author's Affiliation	Department of Information Physics and Computing, University of Tokyo
3rd Author's Name	Shinji WATANABE
3rd Author's Affiliation	NTT Communication Science Laboratories
4th Author's Name	Takuya NISHIMOTO
4th Author's Affiliation	Department of Information Physics and Computing, University of Tokyo
5th Author's Name	Nobutaka ONO
5th Author's Affiliation	Department of Information Physics and Computing, University of Tokyo
6th Author's Name	Shigeki SAGAYAMA
6th Author's Affiliation	Department of Information Physics and Computing, University of Tokyo
Date	2008-12-09
Paper #	NLC2008-24,SP2008-79
Volume (vol)	vol.108
Number (no)	337
Page	pp.pp.-
#Pages	6
Date of Issue