Presentation | 2017-03-02 [Poster Presentation] Study of branch selecting DNN acoustic model for robustness to environmental variation Takafumi Moriya, Taichi Asami, Yoshikazu Yamaguchi, Yushi Aono, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | The performance of speech recognition tasks can be significantly improved by the use of deep neural networks (DNN). Speech recognition system is demanded to high recognition performance with the increase in use scene of itself. However, it needs to prepare acoustic models corresponding to each environment to obtain the best recognition results. Also, to train and make each acoustic model takes a lot of costs that contain preparation of a large amount of training data and computational time for training. The goal of this paper is to obtain an acoustic model that can adapt each environmental speech data and output high recognition results. We propose DNN architecture that is diverged and converged at input, and hidden or output layer respectively. The each pass of diverged DNN architecture is trained by using each environmental speech data, so it has a role for robustness to environmental variation. Compared to no diverged DNN architecture, our proposed DNN architecture improves character accuracy. Its relative error rate is 9.6%. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Speech Recognition / Acoustic Model / Noise Robustness / Deep Neural Network |
Paper # | EA2016-131,SIP2016-186,SP2016-126 |
Date of Issue | 2017-02-22 (EA, SIP, SP) |
Conference Information | |
Committee | SP / SIP / EA |
---|---|
Conference Date | 2017/3/1(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Okinawa Industry Support Center |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Speech, Engineering/Electro Acoustics, Signal Processing, and Related Topics |
Chair | Kazunori Mano(Shibaura Inst. of Tech.) / Makoto Nakashizuka(Chiba Inst. of Tech.) / Mitsunori Mizumachi(Kyushu Inst. of Tech.) |
Vice Chair | Hiroki Mori(Utsunomiya Univ.) / Masahiro Okuda(Univ. of Kitakyushu) / Shogo Muramatsu(Niigata Univ.) / Yoichi Haneda(Univ. of Electro-Comm.) / Suehiro Shimauchi(NTT) |
Secretary | Hiroki Mori(Kobe Univ.) / Masahiro Okuda(Shizuoka Univ.) / Shogo Muramatsu(Ritsumeikan Univ.) / Yoichi Haneda(Chiba Inst. of Tech.) / Suehiro Shimauchi(KDDI R&D Labs.) |
Assistant | Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / Osamu Watanabe(Takushoku Univ.) / Shigeto Takeoka(Shizuoka Inst. of Science and Tech.) / TREVINO Jorge(Tohoku Univ.) |
Paper Information | |
Registration To | Technical Committee on Speech / Technical Committee on Signal Processing / Technical Committee on Engineering Acoustics |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Poster Presentation] Study of branch selecting DNN acoustic model for robustness to environmental variation |
Sub Title (in English) | |
Keyword(1) | Speech Recognition |
Keyword(2) | Acoustic Model |
Keyword(3) | Noise Robustness |
Keyword(4) | Deep Neural Network |
1st Author's Name | Takafumi Moriya |
1st Author's Affiliation | Nippon Telegraph and Telephone Corporation(NTT) |
2nd Author's Name | Taichi Asami |
2nd Author's Affiliation | Nippon Telegraph and Telephone Corporation(NTT) |
3rd Author's Name | Yoshikazu Yamaguchi |
3rd Author's Affiliation | Nippon Telegraph and Telephone Corporation(NTT) |
4th Author's Name | Yushi Aono |
4th Author's Affiliation | Nippon Telegraph and Telephone Corporation(NTT) |
Date | 2017-03-02 |
Paper # | EA2016-131,SIP2016-186,SP2016-126 |
Volume (vol) | vol.116 |
Number (no) | EA-475,SIP-476,SP-477 |
Page | pp.pp.277-282(EA), pp.277-282(SIP), pp.277-282(SP), |
#Pages | 6 |
Date of Issue | 2017-02-22 (EA, SIP, SP) |