Presentation | 2019-03-15 [Poster Presentation] Robustness of statistical voice conversion based on waveform modification against external noise Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda, Tomoki Toda, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this report, we investigate the statistical voice conversion (VC) under noisy environments. VC achieves conversion from input speech to target speech by statistically modeling correspondence between input and target acoustic features. To develop various VC applications, such as augmented speech production and augmented vocal production, it is necessary to handle noisy input speech because various background sounds, such as external noise and accompaniment, usually exist in a real environment.In this report, we investigate an impact of background sounds on conversion performance in singing voice conversion focusing on a vocoder-based conversion method and a vocoder-free conversion method based on direct waveform modification with log-spectral differential compensation (DIFFVC).Results of subjective evaluation show that DIFFVC is robust against background sounds compared with the vocoder-based conversion method.We also analyze the robustness of DIFFVC using a kurtosis ratio as an objective metric to evaluate distribution changes of power spectral components. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Statistical voice conversion / background sounds / vocoder / direct waveform modification / kurtosis ratio |
Paper # | EA2018-153,SIP2018-159,SP2018-115 |
Date of Issue | 2019-03-07 (EA, SIP, SP) |
Conference Information | |
Committee | EA / SIP / SP |
---|---|
Conference Date | 2019/3/14(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | i+Land nagasaki (Nagasaki-shi) |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Engineering/Electro Acoustics, Signal Processing, Speech, and Related Topics |
Chair | Suehiro Shimauchi(Kanazawa Inst. of Tech.) / Shogo Muramatsu(Niigata Univ.) / Yoichi Yamashita(Ritsumeikan Univ.) |
Vice Chair | Kenichi Furuya(Oita Univ.) / Kanji Watanabe(Akita Pref. Univ.) / Naoyuki Aikawa(TUS) / Kazunori Hayashi(Osaka City Univ) / Akinobu Ri(Nagoya Inst. of Tech.) |
Secretary | Kenichi Furuya(Shizuoka Inst. of Science and Tech.) / Kanji Watanabe(NHK) / Naoyuki Aikawa(Takushoku Univ.) / Kazunori Hayashi(Hiroshima Univ.) / Akinobu Ri(Kyoto Univ.) |
Assistant | Keisuke Imoto(Ritsumeikan Univ.) / Daisuke Morikawa(Toyama Pref Univ.) / Katsumi Konishi(Hosei Univ.) / hyihsin(Takushoku Univ.) / Tomoki Koriyama(Tokyo Inst. of Tech.) / Satoshi Kobashikawa(NTT) |
Paper Information | |
Registration To | Technical Committee on Engineering Acoustics / Technical Committee on Signal Processing / Technical Committee on Speech |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Poster Presentation] Robustness of statistical voice conversion based on waveform modification against external noise |
Sub Title (in English) | |
Keyword(1) | Statistical voice conversion |
Keyword(2) | background sounds |
Keyword(3) | vocoder |
Keyword(4) | direct waveform modification |
Keyword(5) | kurtosis ratio |
1st Author's Name | Yusuke Kurita |
1st Author's Affiliation | Nagoya University(Nagoya Univ.) |
2nd Author's Name | Kazuhiro Kobayashi |
2nd Author's Affiliation | Nagoya University(Nagoya Univ.) |
3rd Author's Name | Kazuya Takeda |
3rd Author's Affiliation | Nagoya University(Nagoya Univ.) |
4th Author's Name | Tomoki Toda |
4th Author's Affiliation | Nagoya University/JST PRESTO(Nagoya Univ./JST PRESTO) |
Date | 2019-03-15 |
Paper # | EA2018-153,SIP2018-159,SP2018-115 |
Volume (vol) | vol.118 |
Number (no) | EA-495,SIP-496,SP-497 |
Page | pp.pp.317-322(EA), pp.317-322(SIP), pp.317-322(SP), |
#Pages | 6 |
Date of Issue | 2019-03-07 (EA, SIP, SP) |