Paper Abstract and Keywords |
Presentation |
2009-10-30 09:45
An Evaluation of Statistical Voice Conversion in Speaking-Aid Systems Using External Sound Signals Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano (NAIST) SP2009-57 WIT2009-63 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
This paper presents experimental evaluations of statistical voice conversion from artificial speech spoken by a laryngectomee using some external sound source units. The laryngectomee uses three kinds of the external sound source units; 1)~a sound source unit that generates quite small source signals that cannot be heard by listeners; 2)~a conventional electrolarynx that generates signals with monotone \(F_0\); and 3)~an electrolarynx using an air-pressure sensor that enables the laryngectomee to modulate the \(F_0\) of the sound source signal using the air-pressure output from his/her tracheostoma. The generated artificial speech is detected with a headset microphone or Non-audible murmur microphone, and then, it is converted into whispered voice or normal speech uttered by a nonlaryngectomee.
The experimental results demonstrate that 1)~the use of the air-pressure sensor is effective to improve the voice conversion accuracy, 2)~the modulated \(F_0\) using an air-pressure sensor is less effective in \(F_0\) estimation, but it doesn't cause any significant degradation of the voice conversion accuracy, and 3)~voice conversion yields significant improvements in naturalness so that the converted speech is much more preferred to the original artificial speech by listeners. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Laryngectomees / Electrolarynx / Air-pressure Sensor / Small Sound Source / Statistical Voice Conversion / / / |
Reference Info. |
IEICE Tech. Rep., vol. 109, no. 259, SP2009-57, pp. 49-54, Oct. 2009. |
Paper # |
SP2009-57 |
Date of Issue |
2009-10-22 (SP, WIT) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2009-57 WIT2009-63 |
Conference Information |
Committee |
WIT SP |
Conference Date |
2009-10-29 - 2009-10-30 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
ASPAM |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Welfare and speech processing, etc. |
Paper Information |
Registration To |
SP |
Conference Code |
2009-10-WIT-SP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
An Evaluation of Statistical Voice Conversion in Speaking-Aid Systems Using External Sound Signals |
Sub Title (in English) |
|
Keyword(1) |
Laryngectomees |
Keyword(2) |
Electrolarynx |
Keyword(3) |
Air-pressure Sensor |
Keyword(4) |
Small Sound Source |
Keyword(5) |
Statistical Voice Conversion |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Keigo Nakamura |
1st Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
2nd Author's Name |
Tomoki Toda |
2nd Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
3rd Author's Name |
Hiroshi Saruwatari |
3rd Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
4th Author's Name |
Kiyohiro Shikano |
4th Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2009-10-30 09:45:00 |
Presentation Time |
30 minutes |
Registration for |
SP |
Paper # |
SP2009-57, WIT2009-63 |
Volume (vol) |
vol.109 |
Number (no) |
no.259(SP), no.260(WIT) |
Page |
pp.49-54 |
#Pages |
6 |
Date of Issue |
2009-10-22 (SP, WIT) |
|