Paper Abstract and Keywords |
Presentation |
2019-03-15 13:30
[Poster Presentation]
Robustness of statistical voice conversion based on waveform modification against external noise Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda (Nagoya Univ.), Tomoki Toda (Nagoya Univ./JST PRESTO) EA2018-153 SIP2018-159 SP2018-115 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In this report, we investigate the statistical voice conversion (VC) under noisy environments.
VC achieves conversion from input speech to target speech by statistically modeling correspondence between input and target acoustic features.
To develop various VC applications, such as augmented speech production and augmented vocal production, it is necessary to handle noisy input speech because various background sounds, such as external noise and accompaniment, usually exist in a real environment.In this report, we investigate an impact of background sounds on conversion performance in singing voice conversion focusing on a vocoder-based conversion method and a vocoder-free conversion method based on direct waveform modification with log-spectral differential compensation (DIFFVC).Results of subjective evaluation show that DIFFVC is robust against background sounds compared with the vocoder-based conversion method.We also analyze the robustness of DIFFVC using a kurtosis ratio as an objective metric to evaluate distribution changes of power spectral components. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Statistical voice conversion / background sounds / vocoder / direct waveform modification / kurtosis ratio / / / |
Reference Info. |
IEICE Tech. Rep., vol. 118, no. 497, SP2018-115, pp. 317-322, March 2019. |
Paper # |
SP2018-115 |
Date of Issue |
2019-03-07 (EA, SIP, SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
EA2018-153 SIP2018-159 SP2018-115 |
Conference Information |
Committee |
EA SIP SP |
Conference Date |
2019-03-14 - 2019-03-15 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
i+Land nagasaki (Nagasaki-shi) |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Engineering/Electro Acoustics, Signal Processing, Speech, and Related Topics |
Paper Information |
Registration To |
SP |
Conference Code |
2019-03-EA-SIP-SP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Robustness of statistical voice conversion based on waveform modification against external noise |
Sub Title (in English) |
|
Keyword(1) |
Statistical voice conversion |
Keyword(2) |
background sounds |
Keyword(3) |
vocoder |
Keyword(4) |
direct waveform modification |
Keyword(5) |
kurtosis ratio |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Yusuke Kurita |
1st Author's Affiliation |
Nagoya University (Nagoya Univ.) |
2nd Author's Name |
Kazuhiro Kobayashi |
2nd Author's Affiliation |
Nagoya University (Nagoya Univ.) |
3rd Author's Name |
Kazuya Takeda |
3rd Author's Affiliation |
Nagoya University (Nagoya Univ.) |
4th Author's Name |
Tomoki Toda |
4th Author's Affiliation |
Nagoya University/JST PRESTO (Nagoya Univ./JST PRESTO) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2019-03-15 13:30:00 |
Presentation Time |
90 minutes |
Registration for |
SP |
Paper # |
EA2018-153, SIP2018-159, SP2018-115 |
Volume (vol) |
vol.118 |
Number (no) |
no.495(EA), no.496(SIP), no.497(SP) |
Page |
pp.317-322 |
#Pages |
6 |
Date of Issue |
2019-03-07 (EA, SIP, SP) |
|