交互学習と多層化によるニューラルネットワークビームフォーマの音声歪み低減の検討

江口 航平; 水町 光徳

Presentation	2018-11-21 Improvement of speech distortion caused by multi-layered neural network-based beamformer with alternative training Kouhei Eguchi, Mitsunori Mizumachi,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	A neural network-based beamformer can simultaneously optimize its beam-pattern at the desired and undesired directions so that it enables noise reduction more efficiently compared with a conventional delay-and-sum beamformer with the linear signal processing. In general, the neural network-based beamformer causes the annoying non-linear distortion on the output signal due to the non-linear activation functions. The authors previously proposed a distortion-less neural network-based beamformer with the dual cost functions based on the directivity and spectral distortion. The beamformer was alternatively trained with either the directivity-based or spectral distortion-based cost functions. The proposed beamformer was slightly superior to the conventional non-linear beamformer in the viewpoints of the signal-to-noise ration and spectral distortion. In this paper, the number of the network layers is increased from three to four and five, and the training scheme is reconsidered in training the neural network. It is found that the neural network-based beamformer with five layers achieves the highest performance. Concerning the training scheme, it is necessary to carefully investigate the relationship between the training order with the dual cost functions and the number of iteration.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Alternative training / Beamformer / Neural network / Non-linear distortion
Paper #	EA2018-64,EMM2018-64
Date of Issue	2018-11-14 (EA, EMM)

Conference Information
Committee	EA / ASJ-H / EMM / IPSJ-MUS
Conference Date	2018/11/21(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Hotel Koshuen
Topics (in Japanese)	(See Japanese page)
Topics (in English)	[Beginners Session] Engineering/Electro Acoustics, Psychological and Physiological Acoustics, Music and Computer, Content Processing, Digital Watermarking, and Related Topics
Chair	Suehiro Shimauchi(Kanazawa Inst. of Tech.) / 小澤賢司(山梨大) / Keiichi Iwamura(TUC) / 吉井和佳(京大)
Vice Chair	Kenichi Furuya(Oita Univ.) / Kanji Watanabe(Akita Pref. Univ.) / 中川誠司(千葉大) / Minoru Kuribayashi(Okayama Univ.) / Tetsuya Kojima(NIT,Tokyo College)
Secretary	Kenichi Furuya(Shizuoka Inst. of Science and Tech.) / Kanji Watanabe(NHK) / 中川誠司(尚絅大) / Minoru Kuribayashi(北陸先端大) / Tetsuya Kojima(千葉大) / (NIT, Tokyo)
Assistant	Keisuke Imoto(Ritsumeikan Univ.) / Daisuke Morikawa(Toyama Pref Univ.) / / Hiroko Akiyama(NIT, Nagano College) / キタヒロカネダ(CANON)

Paper Information
Registration To	Technical Committee on Engineering Acoustics / Auditory Research Meeting / Technical Committee on Enriched MultiMedia / Special Interest Group on Music and Computer
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Improvement of speech distortion caused by multi-layered neural network-based beamformer with alternative training
Sub Title (in English)
Keyword(1)	Alternative training
Keyword(2)	Beamformer
Keyword(3)	Neural network
Keyword(4)	Non-linear distortion
1st Author's Name	Kouhei Eguchi
1st Author's Affiliation	Kyushu Institute of Technology(KIT)
2nd Author's Name	Mitsunori Mizumachi
2nd Author's Affiliation	Kyushu Institute of Technology(KIT)
Date	2018-11-21
Paper #	EA2018-64,EMM2018-64
Volume (vol)	vol.118
Number (no)	EA-312,EMM-313
Page	pp.pp.7-12(EA), pp.7-12(EMM),
#Pages	6
Date of Issue	2018-11-14 (EA, EMM)