Presentation | 2016-09-05 Acoustic event detection and removal using LSTM-CTC for speech recognition Yu Nasu, Hiroshi Fujimura, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Deep learning techniques have drastically increased the speech recognition performance. However, there are few practical applications utilizing spontaneous speech recognition because of the difficulty compared to read speech recognition. Spontaneous speech is challenging to recognize due to various causes. In particular, acoustic events such as fillers and hesitation disfluencies can degrade speech recognition accuracy. In this paper, we propose a speech recognition system with simultaneous modeling of phonemes and acoustic events using an LSTM-CTC acoustic model, which efficiently detects or removes acoustic events in recognition outputs. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | speech recognition / acoustic modeling / LSTM / CTC / acoustic event detection |
Paper # | PRMU2016-69,IBISML2016-24 |
Date of Issue | 2016-08-29 (PRMU, IBISML) |
Conference Information | |
Committee | PRMU / IPSJ-CVIM / IBISML |
---|---|
Conference Date | 2016/9/5(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Eisaku Maeda(NTT) / / Kenji Fukumizu(ISM) |
Vice Chair | Seiichi Uchida(Kyushu Univ.) / Hironobu Fujiyoshi(Chubu Univ.) / / Masashi Sugiyama(Univ. of Tokyo) / Hisashi Kashima(Kyoto Univ.) |
Secretary | Seiichi Uchida(Kyoto Univ.) / Hironobu Fujiyoshi(NTT) / / Masashi Sugiyama(Univ. of Tokyo) / Hisashi Kashima(Nagoya Inst. of Tech.) |
Assistant | Masaki Oonishi(AIST) / Takuya Funatomi(NAIST) / / Toshihiro Kamishima(AIST) / Tomoharu Iwata(NTT) |
Paper Information | |
Registration To | Technical Committee on Pattern Recognition and Media Understanding / Special Interest Group on Computer Vision and Image Media / Technical Committee on Infomation-Based Induction Sciences and Machine Learning |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Acoustic event detection and removal using LSTM-CTC for speech recognition |
Sub Title (in English) | |
Keyword(1) | speech recognition |
Keyword(2) | acoustic modeling |
Keyword(3) | LSTM |
Keyword(4) | CTC |
Keyword(5) | acoustic event detection |
1st Author's Name | Yu Nasu |
1st Author's Affiliation | former Corporate Research and Development Center, Toshiba Corporation(former Toshiba) |
2nd Author's Name | Hiroshi Fujimura |
2nd Author's Affiliation | Corporate Research and Development Center, Toshiba Corporation(Toshiba) |
Date | 2016-09-05 |
Paper # | PRMU2016-69,IBISML2016-24 |
Volume (vol) | vol.116 |
Number (no) | PRMU-208,IBISML-209 |
Page | pp.pp.121-126(PRMU), pp.121-126(IBISML), |
#Pages | 6 |
Date of Issue | 2016-08-29 (PRMU, IBISML) |