Presentation 2018-01-21
Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet
Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper proposes a mel-cepstrum based quantization noise shaping for improving the quality of synthetic speech generated by neural network based speech waveform synthesis systems. Since mel-cepstral coefficients closely match the characteristics of human auditory perception, it is expected that the proposed method effectively masks the white noise introduced by the quantization typically used in neural network based speech waveform synthesis systems. The paper also describes a mel-cepstrum based prefiltering to further mask the quantization noise. Experiments using the WaveNet generative model showed that speech quality is significantly improved by the proposed method.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) speech synthesis / noise shaping / quantization / mel-cepstrum / WaveNet
Paper # SP2017-83
Date of Issue 2018-01-13 (SP)

Conference Information
Committee SP / ASJ-H
Conference Date 2018/1/20(2days)
Place (in Japanese) (See Japanese page)
Place (in English) The University of Tokyo
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Yoichi Yamashita(Ritsumeikan Univ.) / 平原 達也(富山県立大)
Vice Chair Hiroki Mori(Utsunomiya Univ.) / 中川 誠司(千葉大)
Secretary Hiroki Mori(Shizuoka Univ.) / 中川 誠司(Meijo Univ.)
Assistant Kei Hashimoto(Nagoya Inst. of Tech.) / Satoshi Kobashikawa(NTT)

Paper Information
Registration To Technical Committee on Speech / Auditory Research Meeting
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet
Sub Title (in English)
Keyword(1) speech synthesis
Keyword(2) noise shaping
Keyword(3) quantization
Keyword(4) mel-cepstrum
Keyword(5) WaveNet
1st Author's Name Takenori Yoshimura
1st Author's Affiliation Nagoya Institute of Technology(Nagoya Inst. of Tech.)
2nd Author's Name Kei Hashimoto
2nd Author's Affiliation Nagoya Institute of Technology(Nagoya Inst. of Tech.)
3rd Author's Name Keiichiro Oura
3rd Author's Affiliation Nagoya Institute of Technology(Nagoya Inst. of Tech.)
4th Author's Name Yoshihiko Nankaku
4th Author's Affiliation Nagoya Institute of Technology(Nagoya Inst. of Tech.)
5th Author's Name Keiichi Tokuda
5th Author's Affiliation Nagoya Institute of Technology(Nagoya Inst. of Tech.)
Date 2018-01-21
Paper # SP2017-83
Volume (vol) vol.117
Number (no) SP-393
Page pp.pp.93-98(SP),
#Pages 6
Date of Issue 2018-01-13 (SP)