Presentation 2016-12-20
[Poster Presentation] Deep Neural Network Using Fundamental Frequency For Noise Robust Speaker Recognition
Yoshihiro Suzuki, Yosuke Sugiura, Tetsuya Shimamura,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this paper, we propose a neural network architecture for speaker recognition to simplify learning process. In the proposed method, we use not only the amplitude spectrum but also a harmonic binary vector generated from fundamental frequency as an input for the network. Hence, it is possible to keep recognition accuracy in noisy environment without using a noisy speech, and also possible to reduce both network size and computation time. In an experiment for 10 speakers, we could confirm a performance improvement in noisy environment relative to using only the amplitude spectrum.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Deep Learning / Neural Network / Speaker Recognition / Fundamental Frequency
Paper # SP2016-58
Date of Issue 2016-12-13 (SP)

Conference Information
Committee SP / IPSJ-SLP / NLC / IPSJ-NL
Conference Date 2016/12/20(3days)
Place (in Japanese) (See Japanese page)
Place (in English) NTT Musashino R&D
Topics (in Japanese) (See Japanese page)
Topics (in English) The 18th Spoken Language Symposium & The Third Natural Language Processing Symposium
Chair Kazunori Mano(Shibaura Inst. of Tech.) / Nobuaki Minematsu(Univ. Tokyo) / Hiroshi Kanayama(IBM) / Kentaro Inui(Tohoku Univ.)
Vice Chair Hiroki Mori(Utsunomiya Univ.) / / Makoto Ichise(NTT DoCoMo) / Takeshi Sakaki(Univ. of Tokyo/Hottolink)
Secretary Hiroki Mori(Kobe Univ.) / (Shizuoka Univ.) / Makoto Ichise(Kyoyo Univ.) / Takeshi Sakaki(Toshiba) / (Tokyo Institute of Technology)
Assistant Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / / Ryuichiro Higashinaka(NTT) / Mitsuo Yoshida(Toyohashi Univ. of Tech.)

Paper Information
Registration To Technical Committee on Speech / Special Interest Group on Spoken Language Processing / Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) [Poster Presentation] Deep Neural Network Using Fundamental Frequency For Noise Robust Speaker Recognition
Sub Title (in English)
Keyword(1) Deep Learning
Keyword(2) Neural Network
Keyword(3) Speaker Recognition
Keyword(4) Fundamental Frequency
1st Author's Name Yoshihiro Suzuki
1st Author's Affiliation Saitama University(Saitama Univ.)
2nd Author's Name Yosuke Sugiura
2nd Author's Affiliation Saitama University(Saitama Univ.)
3rd Author's Name Tetsuya Shimamura
3rd Author's Affiliation Saitama University(Saitama Univ.)
Date 2016-12-20
Paper # SP2016-58
Volume (vol) vol.116
Number (no) SP-378
Page pp.pp.53-56(SP),
#Pages 4
Date of Issue 2016-12-13 (SP)