［ポスター講演］利用環境変動に頑健な分岐選択型DNN音響モデルの検討

森谷 崇史; 浅見 太一; 山口 義和; 青野 裕司

Presentation	2017-03-02 [Poster Presentation] Study of branch selecting DNN acoustic model for robustness to environmental variation Takafumi Moriya, Taichi Asami, Yoshikazu Yamaguchi, Yushi Aono,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	The performance of speech recognition tasks can be significantly improved by the use of deep neural networks (DNN). Speech recognition system is demanded to high recognition performance with the increase in use scene of itself. However, it needs to prepare acoustic models corresponding to each environment to obtain the best recognition results. Also, to train and make each acoustic model takes a lot of costs that contain preparation of a large amount of training data and computational time for training. The goal of this paper is to obtain an acoustic model that can adapt each environmental speech data and output high recognition results. We propose DNN architecture that is diverged and converged at input, and hidden or output layer respectively. The each pass of diverged DNN architecture is trained by using each environmental speech data, so it has a role for robustness to environmental variation. Compared to no diverged DNN architecture, our proposed DNN architecture improves character accuracy. Its relative error rate is 9.6%.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Speech Recognition / Acoustic Model / Noise Robustness / Deep Neural Network
Paper #	EA2016-131,SIP2016-186,SP2016-126
Date of Issue	2017-02-22 (EA, SIP, SP)

Conference Information
Committee	SP / SIP / EA
Conference Date	2017/3/1(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Okinawa Industry Support Center
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Speech, Engineering/Electro Acoustics, Signal Processing, and Related Topics
Chair	Kazunori Mano(Shibaura Inst. of Tech.) / Makoto Nakashizuka(Chiba Inst. of Tech.) / Mitsunori Mizumachi(Kyushu Inst. of Tech.)
Vice Chair	Hiroki Mori(Utsunomiya Univ.) / Masahiro Okuda(Univ. of Kitakyushu) / Shogo Muramatsu(Niigata Univ.) / Yoichi Haneda(Univ. of Electro-Comm.) / Suehiro Shimauchi(NTT)
Secretary	Hiroki Mori(Kobe Univ.) / Masahiro Okuda(Shizuoka Univ.) / Shogo Muramatsu(Ritsumeikan Univ.) / Yoichi Haneda(Chiba Inst. of Tech.) / Suehiro Shimauchi(KDDI R&D Labs.)
Assistant	Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / Osamu Watanabe(Takushoku Univ.) / Shigeto Takeoka(Shizuoka Inst. of Science and Tech.) / TREVINO Jorge(Tohoku Univ.)

Paper Information
Registration To	Technical Committee on Speech / Technical Committee on Signal Processing / Technical Committee on Engineering Acoustics
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	[Poster Presentation] Study of branch selecting DNN acoustic model for robustness to environmental variation
Sub Title (in English)
Keyword(1)	Speech Recognition
Keyword(2)	Acoustic Model
Keyword(3)	Noise Robustness
Keyword(4)	Deep Neural Network
1st Author's Name	Takafumi Moriya
1st Author's Affiliation	Nippon Telegraph and Telephone Corporation(NTT)
2nd Author's Name	Taichi Asami
2nd Author's Affiliation	Nippon Telegraph and Telephone Corporation(NTT)
3rd Author's Name	Yoshikazu Yamaguchi
3rd Author's Affiliation	Nippon Telegraph and Telephone Corporation(NTT)
4th Author's Name	Yushi Aono
4th Author's Affiliation	Nippon Telegraph and Telephone Corporation(NTT)
Date	2017-03-02
Paper #	EA2016-131,SIP2016-186,SP2016-126
Volume (vol)	vol.116
Number (no)	EA-475,SIP-476,SP-477
Page	pp.pp.277-282(EA), pp.277-282(SIP), pp.277-282(SP),
#Pages	6
Date of Issue	2017-02-22 (EA, SIP, SP)