Paper Abstract and Keywords |
Presentation |
2013-05-17 09:45
Fast speech waveform generation using subband coding for speech synthesis Nobuyuki Nishizawa, Tsuneo Kato (KDDI Labs) EA2013-15 SIP2013-15 SP2013-15 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
For fast waveform generation in HMM-based speech synthesizers, a new method using a subband coding method that is also used in the MPEG Audio is proposed in this study. The proposed method is based on a filter bank-based speech synthesis where spectral features are built by combination of scaled band-decomposed source waveforms, and performed on a subband coding system with pre-decomposed and pre-coded source waveforms to reduce computational cost. Furthermore, sinusoidal synthesis directly performed on the subband coding system is also introduced to improve accuracy of spectral feature reproduction mainly in low frequency bands. Thus, no encoding part is necessary in speech synthesizers of the proposed method. In addition, a computation method for spectrum from cepstrum using discrete cosine transformation (DCT) used in the decoder of the subband coding is explained. The result of a subjective evaluation for resynthesized sounds showed that the mean opinion scores of sounds by the proposed method are superior to those by the conventional method using a mel log spectrum approximation (MLSA) filter. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
HMM-based speech synthesis / speech waveform generation / filter bank / subband coding / embedded systems / / / |
Reference Info. |
IEICE Tech. Rep., vol. 113, no. 29, SP2013-15, pp. 85-90, May 2013. |
Paper # |
SP2013-15 |
Date of Issue |
2013-05-09 (EA, SIP, SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
EA2013-15 SIP2013-15 SP2013-15 |
Conference Information |
Committee |
SP EA SIP |
Conference Date |
2013-05-16 - 2013-05-17 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
|
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Speech and Acoustic Signal Processing, Speech, and Related Topics |
Paper Information |
Registration To |
SP |
Conference Code |
2013-05-SP-EA-SIP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Fast speech waveform generation using subband coding for speech synthesis |
Sub Title (in English) |
|
Keyword(1) |
HMM-based speech synthesis |
Keyword(2) |
speech waveform generation |
Keyword(3) |
filter bank |
Keyword(4) |
subband coding |
Keyword(5) |
embedded systems |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Nobuyuki Nishizawa |
1st Author's Affiliation |
KDDI R&D Laboratories, Inc. (KDDI Labs) |
2nd Author's Name |
Tsuneo Kato |
2nd Author's Affiliation |
KDDI R&D Laboratories, Inc. (KDDI Labs) |
3rd Author's Name |
|
3rd Author's Affiliation |
() |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2013-05-17 09:45:00 |
Presentation Time |
25 minutes |
Registration for |
SP |
Paper # |
EA2013-15, SIP2013-15, SP2013-15 |
Volume (vol) |
vol.113 |
Number (no) |
no.27(EA), no.28(SIP), no.29(SP) |
Page |
pp.85-90 |
#Pages |
6 |
Date of Issue |
2013-05-09 (EA, SIP, SP) |
|