［ポスター講演］音声のスペクトル領域とケプストラム領域における同時強調

Presentation	2016-08-24 [Poster Presentation] Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech Li Li, Hirokazu Kameoka, Takuya Higuchi, Hiroshi Saruwatari, Shoji Makino,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	While spectral domain speech enhancement algorithms using non-negative matrix factorization (NMF) are powerful in terms of signal recovery accuracy (e.g., signal-to-noise ratio), they do not necessarily lead to an improvement in the quality of the enhanced speech in the feature domain. This implies that naively using these algorithms as front-end processing for e.g., speech recognition and speech conversion does not always lead to satisfactory results. To address this problem, this paper proposes a novel method that aims to jointly enhance the spectral and cepstral sequences of noisy speech, by optimizing a combined objective function consisting of an NMF-based model-fitting criterion defined in the spectral domain and a Gaussian mixture model (GMM)-based probability distribution defined in the cepstral domain.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	speech enhancement / Gaussian mixture model / non-negative matrix factorization / mel-frequency cepstral coefficients / majorization-minimization
Paper #	SP2016-32
Date of Issue	2016-08-17 (SP)

Conference Information
Committee	SP
Conference Date	2016/8/24(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	ACCMS, Kyoto Univ.
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Audio event processing, etc.
Chair	Kazunori Mano(Shibaura Inst. of Tech.)
Vice Chair	Hiroki Mori(Utsunomiya Univ.)
Secretary	Hiroki Mori(Kobe Univ.)
Assistant	Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.)

Paper Information
Registration To	Technical Committee on Speech
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	[Poster Presentation] Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech
Sub Title (in English)
Keyword(1)	speech enhancement
Keyword(2)	Gaussian mixture model
Keyword(3)	non-negative matrix factorization
Keyword(4)	mel-frequency cepstral coefficients
Keyword(5)	majorization-minimization
1st Author's Name	Li Li
1st Author's Affiliation	University of Tsukuba(Univ.Tsukuba)
2nd Author's Name	Hirokazu Kameoka
2nd Author's Affiliation	Nippon Telegraph and Telephone Corporation(NTT)
3rd Author's Name	Takuya Higuchi
3rd Author's Affiliation	Nippon Telegraph and Telephone Corporation(NTT)
4th Author's Name	Hiroshi Saruwatari
4th Author's Affiliation	University of Tokyo(Univ.Tokyo)
5th Author's Name	Shoji Makino
5th Author's Affiliation	University of Tsukuba(Univ.Tsukuba)
Date	2016-08-24
Paper #	SP2016-32
Volume (vol)	vol.116
Number (no)	SP-189
Page	pp.pp.29-32(SP),
#Pages	4
Date of Issue	2016-08-17 (SP)