Presentation 2023-03-03
Investigation of introducing data augmentation methods to improve speech enhancement performance
Reito Kasuga, Yosuke Sugiura, Nozomiko Yasui, Tetsuya Shimamura,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The field of speech enhancement has been extensively researched worldwide, and many speech enhancement methods have been proposed. However, there are still few large-scale speech datasets that can be used to train high-performance speech enhancement networks, making it difficult to create general-purpose models and making overlearning more likely to occur. In order to solve this problem, a method that enables the creation of large data sets from small data sets and improves the generalization performance of models is considered effective. In this paper, we attempt to solve the data size problem and improve the performance of speech enhancement by introducing a data expansion method called SpecAugment, which has demonstrated excellent performance in the field of speech recognition, to speech enhancement networks.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) speech enhancement / overlearning / speech recognition / SpecAugment / data augmentation
Paper # SIS2022-52
Date of Issue 2023-02-23 (SIS)

Conference Information
Committee SIS
Conference Date 2023/3/2(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Chiba Institute of Technology
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Tomoaki Kimura(Kanagawa Inst. of Tech.)
Vice Chair Naoto Sasaoka(Tottori Univ.) / Hakaru Tamukoh(Kyushu Inst. of Tech.)
Secretary Naoto Sasaoka(NTT) / Hakaru Tamukoh(Kansai Univ.)
Assistant Yoshiaki Makabe(Kanagawa Inst. of Tech.) / Yosuke Sugiura(Saitama Univ.)

Paper Information
Registration To Technical Committee on Smart Info-Media Systems
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Investigation of introducing data augmentation methods to improve speech enhancement performance
Sub Title (in English)
Keyword(1) speech enhancement
Keyword(2) overlearning
Keyword(3) speech recognition
Keyword(4) SpecAugment
Keyword(5) data augmentation
1st Author's Name Reito Kasuga
1st Author's Affiliation Saitama University(Saitama Univ.)
2nd Author's Name Yosuke Sugiura
2nd Author's Affiliation Saitama University(Saitama Univ.)
3rd Author's Name Nozomiko Yasui
3rd Author's Affiliation Saitama University(Saitama Univ.)
4th Author's Name Tetsuya Shimamura
4th Author's Affiliation Saitama University(Saitama Univ.)
Date 2023-03-03
Paper # SIS2022-52
Volume (vol) vol.122
Number (no) SIS-410
Page pp.pp.64-69(SIS),
#Pages 6
Date of Issue 2023-02-23 (SIS)