特定の騒音環境下における音声認識のためのノイズ除去の検討と評価実験

佐野 将太; 村上 史尚; 川喜田 佑介; 宮崎 剛; 田中 博

Presentation	2021-05-27 Investigation and Evaluation Experiment of Noise Removal for Voice Recognition in Specific Noisy Environment Shota Sano, Fumitaka Murakami, Yuusuke Kawakita, Tsuyoshi Miyazaki, Hiroshi Tanaka,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this manuscript, the noise removal performance and speech recognition accuracy is described when noise is removed by assuming the specific situation in order to improve speech recognition accuracy in a noisy environment such as a crowded spot or in a train. Noise removal was performed by using the SS and DAE method in the experiment. We created speech data with noise superimposed with 2 types of noise assuming crowded spot and inside a train, and 6 types of SN ratio of -10, -5, 0, 5, 10, 15 dB. In the DAE method, the noise was removed and compared by using the model created by mixing multiple noises, and learning models individually created by adding each noise with SN condition. The noise removal performance was evaluated by the cosine similarity to the time-series data, the similarity of the spectrogram image by the normalized correlation, and the speech recognition accuracy between speech data before noise superimposition and the noise removal. It was verified that the individual learning model gave better results than the results by the model created by mixing noise. Also it was confirmed that speech recognition was possible with an accuracy of about 80% only for the model individually created under the conditions of SN ratio of 10dB.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Deep Learning / Voice Recognition / Noise Removal / DAE / Spectral Subtraction
Paper #	SeMI2021-2
Date of Issue	2021-05-20 (SeMI)

Conference Information
Committee	SeMI / IPSJ-MBL / IPSJ-DPS / IPSJ-ITS
Conference Date	2021/5/27(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Online
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Susumu Ishihara(Shizuoka Univ.)
Vice Chair	Kazuya Monden(Hitachi) / Koji Yamamoto(Kyoto Univ.)
Secretary	Kazuya Monden(Kyoto Univ.) / Koji Yamamoto(Cyber Univ.) / (Hitachi) / (Waseda Univ.)
Assistant	Yuki Katsumata(NTT DOCOMO) / Yu Nakayama(Tokyo Univ. of Agri. and Tech.) / Akira Uchiyama(Osaka Univ.)

Paper Information
Registration To	Technical Committee on Sensor Network and Mobile Intelligence / Special Interest Group on Mobile Computing and Pervasive Systems / Special Interest Group on Distributed Processing System / Special Interest Group on Intelligent Transport Systems and Smart Community
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Investigation and Evaluation Experiment of Noise Removal for Voice Recognition in Specific Noisy Environment
Sub Title (in English)
Keyword(1)	Deep Learning
Keyword(2)	Voice Recognition
Keyword(3)	Noise Removal
Keyword(4)	DAE
Keyword(5)	Spectral Subtraction
1st Author's Name	Shota Sano
1st Author's Affiliation	Kanagawa Institute of Technology(KAIT)
2nd Author's Name	Fumitaka Murakami
2nd Author's Affiliation	Kanagawa Institute of Technology(KAIT)
3rd Author's Name	Yuusuke Kawakita
3rd Author's Affiliation	Kanagawa Institute of Technology(KAIT)
4th Author's Name	Tsuyoshi Miyazaki
4th Author's Affiliation	Kanagawa Institute of Technology(KAIT)
5th Author's Name	Hiroshi Tanaka
5th Author's Affiliation	Kanagawa Institute of Technology(KAIT)
Date	2021-05-27
Paper #	SeMI2021-2
Volume (vol)	vol.121
Number (no)	SeMI-41
Page	pp.pp.5-10(SeMI),
#Pages	6
Date of Issue	2021-05-20 (SeMI)