Presentation | 2021-05-27 Investigation and Evaluation Experiment of Noise Removal for Voice Recognition in Specific Noisy Environment Shota Sano, Fumitaka Murakami, Yuusuke Kawakita, Tsuyoshi Miyazaki, Hiroshi Tanaka, |
---|---|
PDF Download Page | ![]() |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this manuscript, the noise removal performance and speech recognition accuracy is described when noise is removed by assuming the specific situation in order to improve speech recognition accuracy in a noisy environment such as a crowded spot or in a train. Noise removal was performed by using the SS and DAE method in the experiment. We created speech data with noise superimposed with 2 types of noise assuming crowded spot and inside a train, and 6 types of SN ratio of -10, -5, 0, 5, 10, 15 dB. In the DAE method, the noise was removed and compared by using the model created by mixing multiple noises, and learning models individually created by adding each noise with SN condition. The noise removal performance was evaluated by the cosine similarity to the time-series data, the similarity of the spectrogram image by the normalized correlation, and the speech recognition accuracy between speech data before noise superimposition and the noise removal. It was verified that the individual learning model gave better results than the results by the model created by mixing noise. Also it was confirmed that speech recognition was possible with an accuracy of about 80% only for the model individually created under the conditions of SN ratio of 10dB. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Deep Learning / Voice Recognition / Noise Removal / DAE / Spectral Subtraction |
Paper # | SeMI2021-2 |
Date of Issue | 2021-05-20 (SeMI) |
Conference Information | |
Committee | SeMI / IPSJ-MBL / IPSJ-DPS / IPSJ-ITS |
---|---|
Conference Date | 2021/5/27(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Online |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Susumu Ishihara(Shizuoka Univ.) |
Vice Chair | Kazuya Monden(Hitachi) / Koji Yamamoto(Kyoto Univ.) |
Secretary | Kazuya Monden(Kyoto Univ.) / Koji Yamamoto(Cyber Univ.) / (Hitachi) / (Waseda Univ.) |
Assistant | Yuki Katsumata(NTT DOCOMO) / Yu Nakayama(Tokyo Univ. of Agri. and Tech.) / Akira Uchiyama(Osaka Univ.) |
Paper Information | |
Registration To | Technical Committee on Sensor Network and Mobile Intelligence / Special Interest Group on Mobile Computing and Pervasive Systems / Special Interest Group on Distributed Processing System / Special Interest Group on Intelligent Transport Systems and Smart Community |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Investigation and Evaluation Experiment of Noise Removal for Voice Recognition in Specific Noisy Environment |
Sub Title (in English) | |
Keyword(1) | Deep Learning |
Keyword(2) | Voice Recognition |
Keyword(3) | Noise Removal |
Keyword(4) | DAE |
Keyword(5) | Spectral Subtraction |
1st Author's Name | Shota Sano |
1st Author's Affiliation | Kanagawa Institute of Technology(KAIT) |
2nd Author's Name | Fumitaka Murakami |
2nd Author's Affiliation | Kanagawa Institute of Technology(KAIT) |
3rd Author's Name | Yuusuke Kawakita |
3rd Author's Affiliation | Kanagawa Institute of Technology(KAIT) |
4th Author's Name | Tsuyoshi Miyazaki |
4th Author's Affiliation | Kanagawa Institute of Technology(KAIT) |
5th Author's Name | Hiroshi Tanaka |
5th Author's Affiliation | Kanagawa Institute of Technology(KAIT) |
Date | 2021-05-27 |
Paper # | SeMI2021-2 |
Volume (vol) | vol.121 |
Number (no) | SeMI-41 |
Page | pp.pp.5-10(SeMI), |
#Pages | 6 |
Date of Issue | 2021-05-20 (SeMI) |