Presentation 2017-03-02
[Poster Presentation] Study of branch selecting DNN acoustic model for robustness to environmental variation
Takafumi Moriya, Taichi Asami, Yoshikazu Yamaguchi, Yushi Aono,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The performance of speech recognition tasks can be significantly improved by the use of deep neural networks (DNN). Speech recognition system is demanded to high recognition performance with the increase in use scene of itself. However, it needs to prepare acoustic models corresponding to each environment to obtain the best recognition results. Also, to train and make each acoustic model takes a lot of costs that contain preparation of a large amount of training data and computational time for training. The goal of this paper is to obtain an acoustic model that can adapt each environmental speech data and output high recognition results. We propose DNN architecture that is diverged and converged at input, and hidden or output layer respectively. The each pass of diverged DNN architecture is trained by using each environmental speech data, so it has a role for robustness to environmental variation. Compared to no diverged DNN architecture, our proposed DNN architecture improves character accuracy. Its relative error rate is 9.6%.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Speech Recognition / Acoustic Model / Noise Robustness / Deep Neural Network
Paper # EA2016-131,SIP2016-186,SP2016-126
Date of Issue 2017-02-22 (EA, SIP, SP)

Conference Information
Committee SP / SIP / EA
Conference Date 2017/3/1(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Okinawa Industry Support Center
Topics (in Japanese) (See Japanese page)
Topics (in English) Speech, Engineering/Electro Acoustics, Signal Processing, and Related Topics
Chair Kazunori Mano(Shibaura Inst. of Tech.) / Makoto Nakashizuka(Chiba Inst. of Tech.) / Mitsunori Mizumachi(Kyushu Inst. of Tech.)
Vice Chair Hiroki Mori(Utsunomiya Univ.) / Masahiro Okuda(Univ. of Kitakyushu) / Shogo Muramatsu(Niigata Univ.) / Yoichi Haneda(Univ. of Electro-Comm.) / Suehiro Shimauchi(NTT)
Secretary Hiroki Mori(Kobe Univ.) / Masahiro Okuda(Shizuoka Univ.) / Shogo Muramatsu(Ritsumeikan Univ.) / Yoichi Haneda(Chiba Inst. of Tech.) / Suehiro Shimauchi(KDDI R&D Labs.)
Assistant Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / Osamu Watanabe(Takushoku Univ.) / Shigeto Takeoka(Shizuoka Inst. of Science and Tech.) / TREVINO Jorge(Tohoku Univ.)

Paper Information
Registration To Technical Committee on Speech / Technical Committee on Signal Processing / Technical Committee on Engineering Acoustics
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) [Poster Presentation] Study of branch selecting DNN acoustic model for robustness to environmental variation
Sub Title (in English)
Keyword(1) Speech Recognition
Keyword(2) Acoustic Model
Keyword(3) Noise Robustness
Keyword(4) Deep Neural Network
1st Author's Name Takafumi Moriya
1st Author's Affiliation Nippon Telegraph and Telephone Corporation(NTT)
2nd Author's Name Taichi Asami
2nd Author's Affiliation Nippon Telegraph and Telephone Corporation(NTT)
3rd Author's Name Yoshikazu Yamaguchi
3rd Author's Affiliation Nippon Telegraph and Telephone Corporation(NTT)
4th Author's Name Yushi Aono
4th Author's Affiliation Nippon Telegraph and Telephone Corporation(NTT)
Date 2017-03-02
Paper # EA2016-131,SIP2016-186,SP2016-126
Volume (vol) vol.116
Number (no) EA-475,SIP-476,SP-477
Page pp.pp.277-282(EA), pp.277-282(SIP), pp.277-282(SP),
#Pages 6
Date of Issue 2017-02-22 (EA, SIP, SP)