Presentation | 2014-12-16 Deep neural network-based feature transformation for reverberant speaker identification Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai, Weifeng Li, Masahiro Iwahashi, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Deep neural network has been shown to be effective in many automatic speech recognition. In this paper, we proposed both bottleneck-DNN (BF-DNN) and denoising-autoencoder (DAE) based feature transformation for reverberant speaker identification. For the BF-DNN, we consider that the DNNs can transform the speech frame to a new space with gerater discriminative classification ability for speaker identification. While the DAE dereverberation suppresses the reverberation can improve the performance of speaker identification. Since the BF-DNN and DAE have a great complementary nature, the linear likelihood combination of these two methods is expected be effective for this task. The evaluation experiment shows proposed method performed better than conventional method in reverberant environment. Our proposed method outperforms the multichannel least mean squares. The relative error reduction rate are 21.4% for BF-DNN and 47.0% for DAE, respectively. Moreover, the combination of this two methods further improved the performance. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Deep neural network / Bottleneck feature / Denoising autoencoder / Speaker identification / Dereverberation / Feature transformation |
Paper # | SP2014-119 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2014/12/8(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Deep neural network-based feature transformation for reverberant speaker identification |
Sub Title (in English) | |
Keyword(1) | Deep neural network |
Keyword(2) | Bottleneck feature |
Keyword(3) | Denoising autoencoder |
Keyword(4) | Speaker identification |
Keyword(5) | Dereverberation |
Keyword(6) | Feature transformation |
1st Author's Name | Zhaofeng Zhang |
1st Author's Affiliation | Nagaoka University of Technology() |
2nd Author's Name | Longbiao Wang |
2nd Author's Affiliation | Nagaoka University of Technology |
3rd Author's Name | Atsuhiko Kai |
3rd Author's Affiliation | Shizuoka University |
4th Author's Name | Weifeng Li |
4th Author's Affiliation | Tsinghua University |
5th Author's Name | Masahiro Iwahashi |
5th Author's Affiliation | Nagaoka University of Technology |
Date | 2014-12-16 |
Paper # | SP2014-119 |
Volume (vol) | vol.114 |
Number (no) | 365 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |