Presentation | 2022-10-13 Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks Reito Kasuga, Tetsuya Shimamura, Yosuke Sugiura, Nozomiko Yasui, |
---|---|
PDF Download Page | ![]() |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Although the field of speech enhancement has been extensively studied around the world, phase tends to be neglected compared to amplitude and frequency among the basic quantities handled in speech signal processing. This is because it was believed that the contribution of phase to speech quality was small, based on the perception that human hearing is insensitive to changes in phase. However, with the development of speech signal processing, the importance of phase to speech quality has become clear. In this paper, we introduce the capsule structure of the Capsule Network, which has shown excellent performance in the field of image recognition in recent years, to the speech enhancement network, and attempt to improve the performance of the speech enhancement network and the naturalness of speech by constructing a speech enhancement model that also focuses on phase information. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | speech enhancement / phase / speech quality / Capsule Network / capsule structure / naturalness of speech |
Paper # | SIS2022-12 |
Date of Issue | 2022-10-06 (SIS) |
Conference Information | |
Committee | SIS / ITE-BCT |
---|---|
Conference Date | 2022/10/13(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Hachinohe Institute of Technology |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Tomoaki Kimura(Kanagawa Inst. of Tech.) / 斎藤 恭一(NHK) |
Vice Chair | Naoto Sasaoka(Tottori Univ.) / Hakaru Tamukoh(Kyushu Inst. of Tech.) / 村田 英一(山口大) / 斉藤 一(テレビ東京) |
Secretary | Naoto Sasaoka(NTT) / Hakaru Tamukoh(Kansai Univ.) / 村田 英一(千葉大) / 斉藤 一 |
Assistant | Yoshiaki Makabe(Kanagawa Inst. of Tech.) / Yosuke Sugiura(Saitama Univ.) / 神原 浩平(NHK) / 鈴村 高幸(テレビ朝日) / 松﨑 敬文(NHK) / 宮野 真由子(東芝インフラシステムズ) / 大内 幹博(パナソニック) / 榎 芳栄(TBSテレビ) / 水本 哲弥(日本学術振興会) |
Paper Information | |
Registration To | Technical Committee on Smart Info-Media Systems / Technical Group on Broadcasting Technology |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks |
Sub Title (in English) | |
Keyword(1) | speech enhancement |
Keyword(2) | phase |
Keyword(3) | speech quality |
Keyword(4) | Capsule Network |
Keyword(5) | capsule structure |
Keyword(6) | naturalness of speech |
1st Author's Name | Reito Kasuga |
1st Author's Affiliation | Saitama University(Saitama Univ.) |
2nd Author's Name | Tetsuya Shimamura |
2nd Author's Affiliation | Saitama University(Saitama Univ.) |
3rd Author's Name | Yosuke Sugiura |
3rd Author's Affiliation | Saitama University(Saitama Univ.) |
4th Author's Name | Nozomiko Yasui |
4th Author's Affiliation | Saitama University(Saitama Univ.) |
Date | 2022-10-13 |
Paper # | SIS2022-12 |
Volume (vol) | vol.122 |
Number (no) | SIS-209 |
Page | pp.pp.7-12(SIS), |
#Pages | 6 |
Date of Issue | 2022-10-06 (SIS) |