Presentation | 2016-12-20 [Poster Presentation] Deep Neural Network Using Fundamental Frequency For Noise Robust Speaker Recognition Yoshihiro Suzuki, Yosuke Sugiura, Tetsuya Shimamura, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we propose a neural network architecture for speaker recognition to simplify learning process. In the proposed method, we use not only the amplitude spectrum but also a harmonic binary vector generated from fundamental frequency as an input for the network. Hence, it is possible to keep recognition accuracy in noisy environment without using a noisy speech, and also possible to reduce both network size and computation time. In an experiment for 10 speakers, we could confirm a performance improvement in noisy environment relative to using only the amplitude spectrum. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Deep Learning / Neural Network / Speaker Recognition / Fundamental Frequency |
Paper # | SP2016-58 |
Date of Issue | 2016-12-13 (SP) |
Conference Information | |
Committee | SP / IPSJ-SLP / NLC / IPSJ-NL |
---|---|
Conference Date | 2016/12/20(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | NTT Musashino R&D |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | The 18th Spoken Language Symposium & The Third Natural Language Processing Symposium |
Chair | Kazunori Mano(Shibaura Inst. of Tech.) / Nobuaki Minematsu(Univ. Tokyo) / Hiroshi Kanayama(IBM) / Kentaro Inui(Tohoku Univ.) |
Vice Chair | Hiroki Mori(Utsunomiya Univ.) / / Makoto Ichise(NTT DoCoMo) / Takeshi Sakaki(Univ. of Tokyo/Hottolink) |
Secretary | Hiroki Mori(Kobe Univ.) / (Shizuoka Univ.) / Makoto Ichise(Kyoyo Univ.) / Takeshi Sakaki(Toshiba) / (Tokyo Institute of Technology) |
Assistant | Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / / Ryuichiro Higashinaka(NTT) / Mitsuo Yoshida(Toyohashi Univ. of Tech.) |
Paper Information | |
Registration To | Technical Committee on Speech / Special Interest Group on Spoken Language Processing / Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Poster Presentation] Deep Neural Network Using Fundamental Frequency For Noise Robust Speaker Recognition |
Sub Title (in English) | |
Keyword(1) | Deep Learning |
Keyword(2) | Neural Network |
Keyword(3) | Speaker Recognition |
Keyword(4) | Fundamental Frequency |
1st Author's Name | Yoshihiro Suzuki |
1st Author's Affiliation | Saitama University(Saitama Univ.) |
2nd Author's Name | Yosuke Sugiura |
2nd Author's Affiliation | Saitama University(Saitama Univ.) |
3rd Author's Name | Tetsuya Shimamura |
3rd Author's Affiliation | Saitama University(Saitama Univ.) |
Date | 2016-12-20 |
Paper # | SP2016-58 |
Volume (vol) | vol.116 |
Number (no) | SP-378 |
Page | pp.pp.53-56(SP), |
#Pages | 4 |
Date of Issue | 2016-12-13 (SP) |