Presentation 2022-10-13
Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks
Reito Kasuga, Tetsuya Shimamura, Yosuke Sugiura, Nozomiko Yasui,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Although the field of speech enhancement has been extensively studied around the world, phase tends to be neglected compared to amplitude and frequency among the basic quantities handled in speech signal processing. This is because it was believed that the contribution of phase to speech quality was small, based on the perception that human hearing is insensitive to changes in phase. However, with the development of speech signal processing, the importance of phase to speech quality has become clear. In this paper, we introduce the capsule structure of the Capsule Network, which has shown excellent performance in the field of image recognition in recent years, to the speech enhancement network, and attempt to improve the performance of the speech enhancement network and the naturalness of speech by constructing a speech enhancement model that also focuses on phase information.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) speech enhancement / phase / speech quality / Capsule Network / capsule structure / naturalness of speech
Paper # SIS2022-12
Date of Issue 2022-10-06 (SIS)

Conference Information
Committee SIS / ITE-BCT
Conference Date 2022/10/13(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Hachinohe Institute of Technology
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Tomoaki Kimura(Kanagawa Inst. of Tech.) / 斎藤 恭一(NHK)
Vice Chair Naoto Sasaoka(Tottori Univ.) / Hakaru Tamukoh(Kyushu Inst. of Tech.) / 村田 英一(山口大) / 斉藤 一(テレビ東京)
Secretary Naoto Sasaoka(NTT) / Hakaru Tamukoh(Kansai Univ.) / 村田 英一(千葉大) / 斉藤 一
Assistant Yoshiaki Makabe(Kanagawa Inst. of Tech.) / Yosuke Sugiura(Saitama Univ.) / 神原 浩平(NHK) / 鈴村 高幸(テレビ朝日) / 松﨑 敬文(NHK) / 宮野 真由子(東芝インフラシステムズ) / 大内 幹博(パナソニック) / 榎 芳栄(TBSテレビ) / 水本 哲弥(日本学術振興会)

Paper Information
Registration To Technical Committee on Smart Info-Media Systems / Technical Group on Broadcasting Technology
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks
Sub Title (in English)
Keyword(1) speech enhancement
Keyword(2) phase
Keyword(3) speech quality
Keyword(4) Capsule Network
Keyword(5) capsule structure
Keyword(6) naturalness of speech
1st Author's Name Reito Kasuga
1st Author's Affiliation Saitama University(Saitama Univ.)
2nd Author's Name Tetsuya Shimamura
2nd Author's Affiliation Saitama University(Saitama Univ.)
3rd Author's Name Yosuke Sugiura
3rd Author's Affiliation Saitama University(Saitama Univ.)
4th Author's Name Nozomiko Yasui
4th Author's Affiliation Saitama University(Saitama Univ.)
Date 2022-10-13
Paper # SIS2022-12
Volume (vol) vol.122
Number (no) SIS-209
Page pp.pp.7-12(SIS),
#Pages 6
Date of Issue 2022-10-06 (SIS)