残響環境下でのロバストで正確なF0推定法の比較評価(福祉と知能・情動・認知障害,福祉と音声処理,一般)

Presentation	2008-01-25 Comparative evaluations of robust and accurate F0 estimates in reverberant environments Masashi UNOKI, Toshihiro HOSOROGIYA, Yuichi ISHIMOTO,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper reports comparative evaluations of the method we previously proposed of estimating fundamental frequency (F_0) based on complex cepstrum analysis with nine typical methods over huge speech-sound datasets in both artificial and realistic reverberant environments (in room coustics). They involve several classic algorithms (Cepstrum, AMDF, LPC, and modified autocorrelation) and a few modern algorithms (TEMPO, YIN, and PHIA). The comparative results revealed that the percentage correct rates of the estimated F_0s using them were drastically reduced as the reverberation time increased while F_0 estimated with the proposed method was completely robust and accurate. They also demonstrated that homomorphic analysis and the concept of a source-filter model were relatively effective for estimating F_0. The results also demonstrated that it was much better than the previously reported methods in terms of robustness and providing accurate F_0 estimates in both artificial and realistic reverberant environments.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	F0 estimation / reverberant speech / complex cepstrum analysis / MTF concept / source-filter model
Paper #	TL2007-73,SP2007-168,WIT2007-73
Date of Issue

Paper Information
Registration To	Thought and Language (TL)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Comparative evaluations of robust and accurate F0 estimates in reverberant environments
Sub Title (in English)
Keyword(1)	F0 estimation
Keyword(2)	reverberant speech
Keyword(3)	complex cepstrum analysis
Keyword(4)	MTF concept
Keyword(5)	source-filter model
1st Author's Name	Masashi UNOKI
1st Author's Affiliation	School of Information Science, Japan Advanced Institute of Science and Technology()
2nd Author's Name	Toshihiro HOSOROGIYA
2nd Author's Affiliation	School of Information Science, Japan Advanced Institute of Science and Technology
3rd Author's Name	Yuichi ISHIMOTO
3rd Author's Affiliation	School of Media Science, Tokyo University of Technology
Date	2008-01-25
Paper #	TL2007-73,SP2007-168,WIT2007-73
Volume (vol)	vol.107
Number (no)	433
Page	pp.pp.-
#Pages	6
Date of Issue