Presentation | 2016-08-24 [Poster Presentation] Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech Li Li, Hirokazu Kameoka, Takuya Higuchi, Hiroshi Saruwatari, Shoji Makino, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | While spectral domain speech enhancement algorithms using non-negative matrix factorization (NMF) are powerful in terms of signal recovery accuracy (e.g., signal-to-noise ratio), they do not necessarily lead to an improvement in the quality of the enhanced speech in the feature domain. This implies that naively using these algorithms as front-end processing for e.g., speech recognition and speech conversion does not always lead to satisfactory results. To address this problem, this paper proposes a novel method that aims to jointly enhance the spectral and cepstral sequences of noisy speech, by optimizing a combined objective function consisting of an NMF-based model-fitting criterion defined in the spectral domain and a Gaussian mixture model (GMM)-based probability distribution defined in the cepstral domain. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | speech enhancement / Gaussian mixture model / non-negative matrix factorization / mel-frequency cepstral coefficients / majorization-minimization |
Paper # | SP2016-32 |
Date of Issue | 2016-08-17 (SP) |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2016/8/24(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | ACCMS, Kyoto Univ. |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Audio event processing, etc. |
Chair | Kazunori Mano(Shibaura Inst. of Tech.) |
Vice Chair | Hiroki Mori(Utsunomiya Univ.) |
Secretary | Hiroki Mori(Kobe Univ.) |
Assistant | Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) |
Paper Information | |
Registration To | Technical Committee on Speech |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Poster Presentation] Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech |
Sub Title (in English) | |
Keyword(1) | speech enhancement |
Keyword(2) | Gaussian mixture model |
Keyword(3) | non-negative matrix factorization |
Keyword(4) | mel-frequency cepstral coefficients |
Keyword(5) | majorization-minimization |
1st Author's Name | Li Li |
1st Author's Affiliation | University of Tsukuba(Univ.Tsukuba) |
2nd Author's Name | Hirokazu Kameoka |
2nd Author's Affiliation | Nippon Telegraph and Telephone Corporation(NTT) |
3rd Author's Name | Takuya Higuchi |
3rd Author's Affiliation | Nippon Telegraph and Telephone Corporation(NTT) |
4th Author's Name | Hiroshi Saruwatari |
4th Author's Affiliation | University of Tokyo(Univ.Tokyo) |
5th Author's Name | Shoji Makino |
5th Author's Affiliation | University of Tsukuba(Univ.Tsukuba) |
Date | 2016-08-24 |
Paper # | SP2016-32 |
Volume (vol) | vol.116 |
Number (no) | SP-189 |
Page | pp.pp.29-32(SP), |
#Pages | 4 |
Date of Issue | 2016-08-17 (SP) |