Presentation 2009-05-29
Robust estimation of fundamental frequencies of speech signals in noisy environments based on signal filtering by empirical mode decomposition
Tetsuya MATSUDA, Keikichi HIROSE, Nobuaki MINEMATSU,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) A number of methods have already been developed for the estimation of fundamental frequencies of speech signals. However, almost all the methods observe the signals in a short time span and assume them being linear and stationary in the span, which is not correct strictly speaking. This situation limits precision of the estimation. While the empirical mode decomposition (EMD) is a method for signal analysis not assuming linear and stationary features of signals, it is rather difficult to apply it to the pitch estimation, since components corresponding to fundamental frequencies may spread to several functions after decomposition. This paper introduces a new method of pitch estimation based on EMD, where the problem is solved by applying EMD on the auto-correlation function of the signal. Since, in the lag-time domain, the signal energy concentrates to frequencies corresponding to signal structures, robust estimation of fundamental frequencies is realized by properly selecting a function from decomposed ones. Through experiments on pitch estimation in noisy environments, the robustness of the proposed method was shown.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) pitch estimation / noisy environment / empirical mode decomposition / autocorrelation
Paper # EA2009-10,SIP2009-10,SP2009-15
Date of Issue

Conference Information
Committee SP
Conference Date 2009/5/21(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Robust estimation of fundamental frequencies of speech signals in noisy environments based on signal filtering by empirical mode decomposition
Sub Title (in English)
Keyword(1) pitch estimation
Keyword(2) noisy environment
Keyword(3) empirical mode decomposition
Keyword(4) autocorrelation
1st Author's Name Tetsuya MATSUDA
1st Author's Affiliation Graduate School of Information Science and Technology, The University of Tokyo()
2nd Author's Name Keikichi HIROSE
2nd Author's Affiliation Graduate School of Information Science and Technology, The University of Tokyo
3rd Author's Name Nobuaki MINEMATSU
3rd Author's Affiliation Graduate School of Information Science and Technology, The University of Tokyo
Date 2009-05-29
Paper # EA2009-10,SIP2009-10,SP2009-15
Volume (vol) vol.109
Number (no) 57
Page pp.pp.-
#Pages 6
Date of Issue