距離画像センサから得た話者位置に基づくウェーブレット解析を用いた複数人同時発話音声認識のための音源分離手法

Presentation	2017-12-22 A Sound Source Separation Method for Multiple Person Speech Recognition using Wavelet Analysis Based on Sound Source Position Obtained by Depth Sensor Nobuhiro Uehara, Kazuo Ikeshiro, Hiroki Imamura,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Recently, voice information guidance systems are used for only one person in operating at a city hall. To realize operating for multiple people simultaneously, we propose a sound source separation method using wavelet analysis based on positions of user obtained by depth sensor. Through the results of evaluation experiments, we found the proposed method was equal to or greater than that of a conventional method in recognition accuracy.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Speech Recognition / Sound Source Separation / Noise Reduction / Depth Image / Sparse Modeling
Paper #	SP2017-63
Date of Issue	2017-12-14 (SP)

Conference Information
Committee	NLC / IPSJ-NL / SP / IPSJ-SLP
Conference Date	2017/12/20(3days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Waseda Univ. Green Computing Systems Research Organization
Topics (in Japanese)	(See Japanese page)
Topics (in English)	The 4th Natural Language Processing Symposium & The 19th Spoken Language Symposium
Chair	Hiroshi Kanayama(IBM) / Kentaro Inui(Tohoku Univ.) / Yoichi Yamashita(Ritsumeikan Univ.) / Nobuaki Minematsu(Univ. Tokyo)
Vice Chair	Takeshi Sakaki(Hottolink) / Kazutaka Shimada(Kyushu Inst. of Tech.) / / Hiroki Mori(Utsunomiya Univ.)
Secretary	Takeshi Sakaki(Ryukoku Univ.) / Kazutaka Shimada(NTT) / (Osaka Univ.) / Hiroki Mori(Tokyo Inst. of Tech.) / (Mixi Co. Ltd.)
Assistant	Mitsuo Yoshida(Toyohashi Univ. of Tech.) / Takeshi Kobayakawa(NICT) / / Kei Hashimoto(Nagoya Inst. of Tech.) / Satoshi Kobashikawa(NTT)

Paper Information
Registration To	Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language / Technical Committee on Speech / Special Interest Group on Spoken Language Processing
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	A Sound Source Separation Method for Multiple Person Speech Recognition using Wavelet Analysis Based on Sound Source Position Obtained by Depth Sensor
Sub Title (in English)
Keyword(1)	Speech Recognition
Keyword(2)	Sound Source Separation
Keyword(3)	Noise Reduction
Keyword(4)	Depth Image
Keyword(5)	Sparse Modeling
1st Author's Name	Nobuhiro Uehara
1st Author's Affiliation	Soka University(Soka Univ.)
2nd Author's Name	Kazuo Ikeshiro
2nd Author's Affiliation	Soka University(Soka Univ.)
3rd Author's Name	Hiroki Imamura
3rd Author's Affiliation	Soka University(Soka Univ.)
Date	2017-12-22
Paper #	SP2017-63
Volume (vol)	vol.117
Number (no)	SP-368
Page	pp.pp.79-83(SP),
#Pages	5
Date of Issue	2017-12-14 (SP)