Presentation 2022-03-08
A study on high-intelligibility speech synthesis of dysarthric speakers using voice conversion from normal speech and multi-speaker vocoder
Tetsuro Takano, Takashi Nose, Aoi Kanagaki, Satoshi Watanabe,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this study, we investigated the possibility of generating intelligible synthetic speech by converting the voice of a normal speaker to that of a dysarthric speaker while maintaining the tone of the speaker's voice. Using the fact that a multi-speaker vocoder can produce clear synthetic voice even with a small amount of impaired speaker data, we demonstrated the effectiveness of speech rate conversion to improve voice similarity, pitch augmentation to overcome monotonicity of intonation, and fine tuning to learn with word data.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) dysarthria / voice conversion / multi-speaker vocoder / CycleGAN / HiFi-GAN
Paper # WIT2021-46
Date of Issue 2022-03-01 (WIT)

Conference Information
Committee WIT / IPSJ-AAC
Conference Date 2022/3/8(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Online
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Shinji Sakou(Nagoya Inst. of Tech.)
Vice Chair Tomohiro Amemiya(Univ. of Tokyo)
Secretary Tomohiro Amemiya(Saitama Industrial Tech. Center) / (Teikyo Univ.)
Assistant Minako Hosono(AIST) / Aki Sugano(Nagoya Univ.) / Tomoyasu Komori(NHK)

Paper Information
Registration To Technical Committee on Well-being Information Technology / Special Interest Group on Assistive & Accessible Computin
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A study on high-intelligibility speech synthesis of dysarthric speakers using voice conversion from normal speech and multi-speaker vocoder
Sub Title (in English)
Keyword(1) dysarthria
Keyword(2) voice conversion
Keyword(3) multi-speaker vocoder
Keyword(4) CycleGAN
Keyword(5) HiFi-GAN
1st Author's Name Tetsuro Takano
1st Author's Affiliation Human Techno System Co., Ltd(HTS)
2nd Author's Name Takashi Nose
2nd Author's Affiliation Tohoku University(Tohoku Univ.)
3rd Author's Name Aoi Kanagaki
3rd Author's Affiliation Tohoku University(Tohoku Univ.)
4th Author's Name Satoshi Watanabe
4th Author's Affiliation Human Techno System Co., Ltd(HTS)
Date 2022-03-08
Paper # WIT2021-46
Volume (vol) vol.121
Number (no) WIT-418
Page pp.pp.18-23(WIT),
#Pages 6
Date of Issue 2022-03-01 (WIT)