Paper Abstract and Keywords |
Presentation |
2009-06-25 14:30
A mean F0 speaker adaptation method for regression model-based F0 contour generation Hosana Kamiyama, Takahiro Shinozaki (Tokyo Inst. of Tech.), Koji Iwano (Tokyo City Univ.), Sadaoki Furui (Tokyo Inst. of Tech.) SP2009-38 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
This paper proposes a new speaker adaptation method for the fundamental frequency ($F_0$) contour generation models based on the Quantification Theory (Type I). In this method, natural $F_0$ contour producing models for standard Japanese are trained using a large amount of speech data from many speakers, and natural as well as speaker-specific $F_0$ contours are generated by adapting mean $F_0$ values using a small amount of speech data from a specific speaker. Objective evaluation results using the models made by the proposed method confirm that around five sentences are enough for speaker adaptation. Subjective evaluation results confirm that naturalness of the synthesized speech using models adapted by 50 sentences is almost equivalent to that of the synthesized speech using models trained by 450 sentences for a specific speaker. These results indicate that the proposed adaptation method can produce highly natural synthesized speech. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
HMM-based Speech Synthesis / Quantification Theory (Type I) / F0 Contour Generation / Prosody Control / Speaker Adaptation / / / |
Reference Info. |
IEICE Tech. Rep., vol. 109, no. 99, SP2009-38, pp. 87-92, June 2009. |
Paper # |
SP2009-38 |
Date of Issue |
2009-06-17 (SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2009-38 |
Conference Information |
Committee |
SP |
Conference Date |
2009-06-24 - 2009-06-25 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Clark Memorial Hall, Hokkaido Univ. |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
emotional speech, prosody, voice quality, speech production and perception, brain activity, etc. |
Paper Information |
Registration To |
SP |
Conference Code |
2009-06-SP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
A mean F0 speaker adaptation method for regression model-based F0 contour generation |
Sub Title (in English) |
|
Keyword(1) |
HMM-based Speech Synthesis |
Keyword(2) |
Quantification Theory (Type I) |
Keyword(3) |
F0 Contour Generation |
Keyword(4) |
Prosody Control |
Keyword(5) |
Speaker Adaptation |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Hosana Kamiyama |
1st Author's Affiliation |
Tokyo Institute of Technology (Tokyo Inst. of Tech.) |
2nd Author's Name |
Takahiro Shinozaki |
2nd Author's Affiliation |
Tokyo Institute of Technology (Tokyo Inst. of Tech.) |
3rd Author's Name |
Koji Iwano |
3rd Author's Affiliation |
Tokyo City University (Tokyo City Univ.) |
4th Author's Name |
Sadaoki Furui |
4th Author's Affiliation |
Tokyo Institute of Technology (Tokyo Inst. of Tech.) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2009-06-25 14:30:00 |
Presentation Time |
30 minutes |
Registration for |
SP |
Paper # |
SP2009-38 |
Volume (vol) |
vol.109 |
Number (no) |
no.99 |
Page |
pp.87-92 |
#Pages |
6 |
Date of Issue |
2009-06-17 (SP) |
|