Presentation | 2018-01-21 Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper proposes a mel-cepstrum based quantization noise shaping for improving the quality of synthetic speech generated by neural network based speech waveform synthesis systems. Since mel-cepstral coefficients closely match the characteristics of human auditory perception, it is expected that the proposed method effectively masks the white noise introduced by the quantization typically used in neural network based speech waveform synthesis systems. The paper also describes a mel-cepstrum based prefiltering to further mask the quantization noise. Experiments using the WaveNet generative model showed that speech quality is significantly improved by the proposed method. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | speech synthesis / noise shaping / quantization / mel-cepstrum / WaveNet |
Paper # | SP2017-83 |
Date of Issue | 2018-01-13 (SP) |
Conference Information | |
Committee | SP / ASJ-H |
---|---|
Conference Date | 2018/1/20(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | The University of Tokyo |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Yoichi Yamashita(Ritsumeikan Univ.) / 平原 達也(富山県立大) |
Vice Chair | Hiroki Mori(Utsunomiya Univ.) / 中川 誠司(千葉大) |
Secretary | Hiroki Mori(Shizuoka Univ.) / 中川 誠司(Meijo Univ.) |
Assistant | Kei Hashimoto(Nagoya Inst. of Tech.) / Satoshi Kobashikawa(NTT) |
Paper Information | |
Registration To | Technical Committee on Speech / Auditory Research Meeting |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet |
Sub Title (in English) | |
Keyword(1) | speech synthesis |
Keyword(2) | noise shaping |
Keyword(3) | quantization |
Keyword(4) | mel-cepstrum |
Keyword(5) | WaveNet |
1st Author's Name | Takenori Yoshimura |
1st Author's Affiliation | Nagoya Institute of Technology(Nagoya Inst. of Tech.) |
2nd Author's Name | Kei Hashimoto |
2nd Author's Affiliation | Nagoya Institute of Technology(Nagoya Inst. of Tech.) |
3rd Author's Name | Keiichiro Oura |
3rd Author's Affiliation | Nagoya Institute of Technology(Nagoya Inst. of Tech.) |
4th Author's Name | Yoshihiko Nankaku |
4th Author's Affiliation | Nagoya Institute of Technology(Nagoya Inst. of Tech.) |
5th Author's Name | Keiichi Tokuda |
5th Author's Affiliation | Nagoya Institute of Technology(Nagoya Inst. of Tech.) |
Date | 2018-01-21 |
Paper # | SP2017-83 |
Volume (vol) | vol.117 |
Number (no) | SP-393 |
Page | pp.pp.93-98(SP), |
#Pages | 6 |
Date of Issue | 2018-01-13 (SP) |