メルケプストラムに基づくノイズシェーピング量子化法のWaveNet音声合成への適用

Presentation	2018-01-21 Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper proposes a mel-cepstrum based quantization noise shaping for improving the quality of synthetic speech generated by neural network based speech waveform synthesis systems. Since mel-cepstral coefficients closely match the characteristics of human auditory perception, it is expected that the proposed method effectively masks the white noise introduced by the quantization typically used in neural network based speech waveform synthesis systems. The paper also describes a mel-cepstrum based prefiltering to further mask the quantization noise. Experiments using the WaveNet generative model showed that speech quality is significantly improved by the proposed method.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	speech synthesis / noise shaping / quantization / mel-cepstrum / WaveNet
Paper #	SP2017-83
Date of Issue	2018-01-13 (SP)

Conference Information
Committee	SP / ASJ-H
Conference Date	2018/1/20(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	The University of Tokyo
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Yoichi Yamashita(Ritsumeikan Univ.) / 平原達也(富山県立大)
Vice Chair	Hiroki Mori(Utsunomiya Univ.) / 中川誠司(千葉大)
Secretary	Hiroki Mori(Shizuoka Univ.) / 中川誠司(Meijo Univ.)
Assistant	Kei Hashimoto(Nagoya Inst. of Tech.) / Satoshi Kobashikawa(NTT)

Paper Information
Registration To	Technical Committee on Speech / Auditory Research Meeting
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet
Sub Title (in English)
Keyword(1)	speech synthesis
Keyword(2)	noise shaping
Keyword(3)	quantization
Keyword(4)	mel-cepstrum
Keyword(5)	WaveNet
1st Author's Name	Takenori Yoshimura
1st Author's Affiliation	Nagoya Institute of Technology(Nagoya Inst. of Tech.)
2nd Author's Name	Kei Hashimoto
2nd Author's Affiliation	Nagoya Institute of Technology(Nagoya Inst. of Tech.)
3rd Author's Name	Keiichiro Oura
3rd Author's Affiliation	Nagoya Institute of Technology(Nagoya Inst. of Tech.)
4th Author's Name	Yoshihiko Nankaku
4th Author's Affiliation	Nagoya Institute of Technology(Nagoya Inst. of Tech.)
5th Author's Name	Keiichi Tokuda
5th Author's Affiliation	Nagoya Institute of Technology(Nagoya Inst. of Tech.)
Date	2018-01-21
Paper #	SP2017-83
Volume (vol)	vol.117
Number (no)	SP-393
Page	pp.pp.93-98(SP),
#Pages	6
Date of Issue	2018-01-13 (SP)