Paper Abstract and Keywords |
Presentation |
2012-06-14 16:00
Perceptual evaluation of synthesized speech reflecting "personalities" Minoru Tsuzaki (KCUA), Keiichi Tokuda (NITEC), Hisashi Kawai (KDDI R&D Labs), Yoshinori Shiga, Jinfu Ni (NICT), Keiichiro Oura, Sayaka Shiota (NITEC) SP2012-39 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Perceptual evaluation tests were performed for talker selection methods in the application of the speaker adaptation framework in an HMM speech synthesis technique. The speaker adaptation was tried to afford the personality of input Japanese utterances in synthesizing English utterances. Three selection methods as follows were evaluated: (a) choosing an acoustic model in the GMM built for the English corpus on the basis of the maximum likelihood to the input Japanese voice, (b) choosing by the weighted interpolation in the English space with the reference points of the bilingual speakers, (c) choosing by the multiple linear prediction using the auditory parameters estimated for the perceptual space of the bilingual speakers. Two types of perceptual tests were carried out. The first one was to ask listeners to choose one of the paired Japanese utterances which was heard to be "mimicked" by the English synthesized utterance. The second one was to ask listeners to choose one of the paired synthesized English utterances which was heard to "mimic" the Japanese natural utterance. The performances of all the selection methods were significantly above the chance in both tasks, except for the type (c) selection in the second task. However, the performance levels were not so high, which implies that further improvement will be required. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
speech synthesis / HMM synthesis / speaker adaptation / perceptual evaluation / personality / bilingual corpora / / |
Reference Info. |
IEICE Tech. Rep., vol. 112, no. 81, SP2012-39, pp. 33-38, June 2012. |
Paper # |
SP2012-39 |
Date of Issue |
2012-06-07 (SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2012-39 |
|