Presentation 2016-08-24
[Poster Presentation] Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech
Li Li, Hirokazu Kameoka, Takuya Higuchi, Hiroshi Saruwatari, Shoji Makino,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) While spectral domain speech enhancement algorithms using non-negative matrix factorization (NMF) are powerful in terms of signal recovery accuracy (e.g., signal-to-noise ratio), they do not necessarily lead to an improvement in the quality of the enhanced speech in the feature domain. This implies that naively using these algorithms as front-end processing for e.g., speech recognition and speech conversion does not always lead to satisfactory results. To address this problem, this paper proposes a novel method that aims to jointly enhance the spectral and cepstral sequences of noisy speech, by optimizing a combined objective function consisting of an NMF-based model-fitting criterion defined in the spectral domain and a Gaussian mixture model (GMM)-based probability distribution defined in the cepstral domain.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) speech enhancement / Gaussian mixture model / non-negative matrix factorization / mel-frequency cepstral coefficients / majorization-minimization
Paper # SP2016-32
Date of Issue 2016-08-17 (SP)

Conference Information
Committee SP
Conference Date 2016/8/24(2days)
Place (in Japanese) (See Japanese page)
Place (in English) ACCMS, Kyoto Univ.
Topics (in Japanese) (See Japanese page)
Topics (in English) Audio event processing, etc.
Chair Kazunori Mano(Shibaura Inst. of Tech.)
Vice Chair Hiroki Mori(Utsunomiya Univ.)
Secretary Hiroki Mori(Kobe Univ.)
Assistant Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.)

Paper Information
Registration To Technical Committee on Speech
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) [Poster Presentation] Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech
Sub Title (in English)
Keyword(1) speech enhancement
Keyword(2) Gaussian mixture model
Keyword(3) non-negative matrix factorization
Keyword(4) mel-frequency cepstral coefficients
Keyword(5) majorization-minimization
1st Author's Name Li Li
1st Author's Affiliation University of Tsukuba(Univ.Tsukuba)
2nd Author's Name Hirokazu Kameoka
2nd Author's Affiliation Nippon Telegraph and Telephone Corporation(NTT)
3rd Author's Name Takuya Higuchi
3rd Author's Affiliation Nippon Telegraph and Telephone Corporation(NTT)
4th Author's Name Hiroshi Saruwatari
4th Author's Affiliation University of Tokyo(Univ.Tokyo)
5th Author's Name Shoji Makino
5th Author's Affiliation University of Tsukuba(Univ.Tsukuba)
Date 2016-08-24
Paper # SP2016-32
Volume (vol) vol.116
Number (no) SP-189
Page pp.pp.29-32(SP),
#Pages 4
Date of Issue 2016-08-17 (SP)