Presentation 2016-01-14
Pitch-synchronous band group delay vocoder for high quality speech synthesis
Masatsune Tamura, Ryo Morinaka, Masahiro Morita,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper presents a speech analysis and synthesis method that can precisely synthesize speech waveforms for high quality statistical parametric speech synthesis. The proposed method is based on pitch-synchronous analysis. A power spectrum, aperiodicity measure, pitch, and phase spectrum of each analysis frame is represented by a mel LSP (MLSP), band aperiodicity (BAP), log fundamental frequency (LF0) and new band group delay with compensation parameter (BGRDC), respectively. The BGRDC consists of a band group delay parameter which represents a mean time of each frequency band and a compensation parameter which recovers a phase spectrum at the boundary of the band. We also propose a band group delay vocoder that enables fast generation of speech waveforms by using time domain excitation generation and a vocal tract filter. We show that the proposed method can precisely generate speech waveforms by objective and subjective evaluations.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Speech Analysis / Speech Synthesis / Vocoder / Pitch-synchronous analysis / Phase spectrum / Group delay
Paper # SP2015-91
Date of Issue 2016-01-07 (SP)

Conference Information
Committee SP
Conference Date 2016/1/14(1days)
Place (in Japanese) (See Japanese page)
Place (in English) Sunpian Kawasaki
Topics (in Japanese) (See Japanese page)
Topics (in English) Synthesis, Generation, Prosody, etc.
Chair Kazunori Mano(Shibaura Inst. of Tech.)
Vice Chair Norihide Kitaoka(Tokushima Univ.)
Secretary Norihide Kitaoka(Tokyo City Univ.)
Assistant Takashi Nose(Tohoku Univ.) / Taichi Asami(NTT)

Paper Information
Registration To Technical Committee on Speech
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Pitch-synchronous band group delay vocoder for high quality speech synthesis
Sub Title (in English)
Keyword(1) Speech Analysis
Keyword(2) Speech Synthesis
Keyword(3) Vocoder
Keyword(4) Pitch-synchronous analysis
Keyword(5) Phase spectrum
Keyword(6) Group delay
1st Author's Name Masatsune Tamura
1st Author's Affiliation Toshiba Corporation(Toshiba)
2nd Author's Name Ryo Morinaka
2nd Author's Affiliation Toshiba Corporation(Toshiba)
3rd Author's Name Masahiro Morita
3rd Author's Affiliation Toshiba Corporation(Toshiba)
Date 2016-01-14
Paper # SP2015-91
Volume (vol) vol.115
Number (no) SP-392
Page pp.pp.33-38(SP),
#Pages 6
Date of Issue 2016-01-07 (SP)