Presentation | 2018-03-20 A Study on Structure of Deep Neural Network for Speech Enhancement Yosuke Sugiura, Tetsuya Shimamura, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we study a structure of a deep neural network for speech enhancement.In speech enhancement, it is a big issue to suppress the noise in the silent interval and maintain a spectral envelope in the voiced interval, simultaneously. Because of a large variability of the amplitude spectrum, which is one of the features, and the complexity of the regression problems, the network has an under-fitting and then the speech quality is degraded. This paper, thus, relaxes the complexity of the problem by constraining a solution space to optimize an spectral gain of speech enhancement. Additionally, this paper tries to improve the speech enhancement performance by designing a loss function appropriately according to the distribution of the speech spectrum. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Speech Enhancement / Deep Neural Network |
Paper # | EA2017-171,SIP2017-180,SP2017-154 |
Date of Issue | 2018-03-12 (EA, SIP, SP) |
Conference Information | |
Committee | SIP / EA / SP / MI |
---|---|
Conference Date | 2018/3/19(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Speech, Engineering/Electro Acoustics, Signal Processing, and Related Topics [SIP, EA, SP]/ Medical Image Engineering, Analysis, Recognition, etc. [MI] |
Chair | Masahiro Okuda(Univ. of Kitakyushu) / Suehiro Shimauchi(NTT) / Yoichi Yamashita(Ritsumeikan Univ.) / Kensaku Mori(Nagoya Univ.) |
Vice Chair | Shogo Muramatsu(Niigata Univ.) / Naoyuki Aikawa(TUS) / Mitsunori Mizumachi(Kyutech) / Hiroki Mori(Utsunomiya Univ.) / Yoshiki Kawata(Tokushima Univ.) / Yuichi Kimura(Kinki Univ.) |
Secretary | Shogo Muramatsu(Chiba Inst. of Tech.) / Naoyuki Aikawa(Takushoku Univ.) / Mitsunori Mizumachi(Akita Pref. Univ.) / Hiroki Mori(Shizuoka Inst. of Science and Tech.) / Yoshiki Kawata(Shizuoka Univ.) / Yuichi Kimura(Meijo Univ.) |
Assistant | Masayoshi Nakamoto(Hiroshima Univ.ひろ) / TREVINO Jorge(Tohoku Univ.) / Nobutaka Ito(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / Satoshi Kobashikawa(NTT) / Ryo Haraguchi(Univ. of Hyogo) / Yasushi Hirano(Yamaguchi Univ.) |
Paper Information | |
Registration To | Technical Committee on Signal Processing / Technical Committee on Engineering Acoustics / Technical Committee on Speech / Technical Committee on Medical Imaging |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A Study on Structure of Deep Neural Network for Speech Enhancement |
Sub Title (in English) | |
Keyword(1) | Speech Enhancement |
Keyword(2) | Deep Neural Network |
1st Author's Name | Yosuke Sugiura |
1st Author's Affiliation | Saitama University(Saitama Univ.) |
2nd Author's Name | Tetsuya Shimamura |
2nd Author's Affiliation | Saitama University(Saitama Univ.) |
Date | 2018-03-20 |
Paper # | EA2017-171,SIP2017-180,SP2017-154 |
Volume (vol) | vol.117 |
Number (no) | EA-515,SIP-516,SP-517 |
Page | pp.pp.379-384(EA), pp.379-384(SIP), pp.379-384(SP), |
#Pages | 6 |
Date of Issue | 2018-03-12 (EA, SIP, SP) |