雑音に頑健な基本周波数抽出法とその音声認識への適用

Presentation	2002/4/19 Robust F_0 Extraction for Noisy Environments and Its Use for Speech Recognition Koji IWANO, Takahiro SEKI, Sadaoki FURUI,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper proposes a noise robust speech recognition method using prosodic information. In Japanese, fundamental frequency (F_0) contour represents phrase intonation and word accent information. Consequently, it conveys information about prosodic phrase and word boundaries. We have developed a robust F_0 extraction method using Hough transform, which yields high extracting rate under various noise conditions. In this paper, we propose a noise robust speech recognition method using syllable HMMs which model both segmental spectral features and F_0 contour information. Speaker-independent experiments are conducted using connected digits uttered by 11 male speakers in various kinds of noise and SNR conditions. The recognition accuracy is improved in all noise conditions, and the best absolute improvement of digit accuracy is about 4.7%. This improvement is achieved due to the more precise digit boundary detection by the robust prosodic information.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Noise robust speech recognition / Prosodic information / Fundamental frequency (F_0) contour
Paper #	SP2002-13
Date of Issue

Paper Information
Registration To	Speech (SP)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Robust F_0 Extraction for Noisy Environments and Its Use for Speech Recognition
Sub Title (in English)
Keyword(1)	Noise robust speech recognition
Keyword(2)	Prosodic information
Keyword(3)	Fundamental frequency (F_0) contour
1st Author's Name	Koji IWANO
1st Author's Affiliation	Department of Computer Science, Tokyo Institute of Technology()
2nd Author's Name	Takahiro SEKI
2nd Author's Affiliation	Department of Computer Science, Tokyo Institute of Technology
3rd Author's Name	Sadaoki FURUI
3rd Author's Affiliation	Department of Computer Science, Tokyo Institute of Technology
Date	2002/4/19
Paper #	SP2002-13
Volume (vol)	vol.102
Number (no)	35
Page	pp.pp.-
#Pages	6
Date of Issue