［ポスター講演］雑音に頑強な話者認識のための基本周波数を用いた深層ニューラルネットワーク

Presentation	2016-12-20 [Poster Presentation] Deep Neural Network Using Fundamental Frequency For Noise Robust Speaker Recognition Yoshihiro Suzuki, Yosuke Sugiura, Tetsuya Shimamura,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, we propose a neural network architecture for speaker recognition to simplify learning process. In the proposed method, we use not only the amplitude spectrum but also a harmonic binary vector generated from fundamental frequency as an input for the network. Hence, it is possible to keep recognition accuracy in noisy environment without using a noisy speech, and also possible to reduce both network size and computation time. In an experiment for 10 speakers, we could confirm a performance improvement in noisy environment relative to using only the amplitude spectrum.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Deep Learning / Neural Network / Speaker Recognition / Fundamental Frequency
Paper #	SP2016-58
Date of Issue	2016-12-13 (SP)

Conference Information
Committee	SP / IPSJ-SLP / NLC / IPSJ-NL
Conference Date	2016/12/20(3days)
Place (in Japanese)	(See Japanese page)
Place (in English)	NTT Musashino R&D
Topics (in Japanese)	(See Japanese page)
Topics (in English)	The 18th Spoken Language Symposium & The Third Natural Language Processing Symposium
Chair	Kazunori Mano(Shibaura Inst. of Tech.) / Nobuaki Minematsu(Univ. Tokyo) / Hiroshi Kanayama(IBM) / Kentaro Inui(Tohoku Univ.)
Vice Chair	Hiroki Mori(Utsunomiya Univ.) / / Makoto Ichise(NTT DoCoMo) / Takeshi Sakaki(Univ. of Tokyo/Hottolink)
Secretary	Hiroki Mori(Kobe Univ.) / (Shizuoka Univ.) / Makoto Ichise(Kyoyo Univ.) / Takeshi Sakaki(Toshiba) / (Tokyo Institute of Technology)
Assistant	Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / / Ryuichiro Higashinaka(NTT) / Mitsuo Yoshida(Toyohashi Univ. of Tech.)

Paper Information
Registration To	Technical Committee on Speech / Special Interest Group on Spoken Language Processing / Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	[Poster Presentation] Deep Neural Network Using Fundamental Frequency For Noise Robust Speaker Recognition
Sub Title (in English)
Keyword(1)	Deep Learning
Keyword(2)	Neural Network
Keyword(3)	Speaker Recognition
Keyword(4)	Fundamental Frequency
1st Author's Name	Yoshihiro Suzuki
1st Author's Affiliation	Saitama University(Saitama Univ.)
2nd Author's Name	Yosuke Sugiura
2nd Author's Affiliation	Saitama University(Saitama Univ.)
3rd Author's Name	Tetsuya Shimamura
3rd Author's Affiliation	Saitama University(Saitama Univ.)
Date	2016-12-20
Paper #	SP2016-58
Volume (vol)	vol.116
Number (no)	SP-378
Page	pp.pp.53-56(SP),
#Pages	4
Date of Issue	2016-12-13 (SP)