Presentation 2008-01-25
Comparative evaluations of robust and accurate F0 estimates in reverberant environments
Masashi UNOKI, Toshihiro HOSOROGIYA, Yuichi ISHIMOTO,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper reports comparative evaluations of the method we previously proposed of estimating fundamental frequency (F_0) based on complex cepstrum analysis with nine typical methods over huge speech-sound datasets in both artificial and realistic reverberant environments (in room coustics). They involve several classic algorithms (Cepstrum, AMDF, LPC, and modified autocorrelation) and a few modern algorithms (TEMPO, YIN, and PHIA). The comparative results revealed that the percentage correct rates of the estimated F_0s using them were drastically reduced as the reverberation time increased while F_0 estimated with the proposed method was completely robust and accurate. They also demonstrated that homomorphic analysis and the concept of a source-filter model were relatively effective for estimating F_0. The results also demonstrated that it was much better than the previously reported methods in terms of robustness and providing accurate F_0 estimates in both artificial and realistic reverberant environments.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) F0 estimation / reverberant speech / complex cepstrum analysis / MTF concept / source-filter model
Paper # TL2007-73,SP2007-168,WIT2007-73
Date of Issue

Conference Information
Committee TL
Conference Date 2008/1/18(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Thought and Language (TL)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Comparative evaluations of robust and accurate F0 estimates in reverberant environments
Sub Title (in English)
Keyword(1) F0 estimation
Keyword(2) reverberant speech
Keyword(3) complex cepstrum analysis
Keyword(4) MTF concept
Keyword(5) source-filter model
1st Author's Name Masashi UNOKI
1st Author's Affiliation School of Information Science, Japan Advanced Institute of Science and Technology()
2nd Author's Name Toshihiro HOSOROGIYA
2nd Author's Affiliation School of Information Science, Japan Advanced Institute of Science and Technology
3rd Author's Name Yuichi ISHIMOTO
3rd Author's Affiliation School of Media Science, Tokyo University of Technology
Date 2008-01-25
Paper # TL2007-73,SP2007-168,WIT2007-73
Volume (vol) vol.107
Number (no) 433
Page pp.pp.-
#Pages 6
Date of Issue