識別的特徴抽出に基づく音声区間検出の検討(Session-8 ポスターセッション: 一般, 第7回音声言語シンポジウム)

Presentation	2005/12/15 A Study on Endpoint Detection for Speech Recognition Based on Discriminative Feature Extraction Koichi Yamamoto, Jabloun Firas, Klaus Reinhard, Akinori Kawamura,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Accurate endpoint detection is important to improve the speech recognition capability. This paper proposes a novel endpoint detection method which combines energy-based and likelihood ratio-based voice activity detection (VAD) criteria, where the likelihood ratio is calculated with speech/non-speech Gaussian mixture models (GMMs). Moreover, the proposed method introduces the discriminative feature extraction method (DFE) in order to improve the speech/non-speech classification. The DFE is used in the training of parameters required for calculating the likelihood ratio. Our experimental evaluation showed that the proposed method reduces the recognition error rate compared to a conventional energy-based technique.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Endpoint detection / VAD / DFE / GMM
Paper #	NLC2005-93,SP2005-126
Date of Issue

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	A Study on Endpoint Detection for Speech Recognition Based on Discriminative Feature Extraction
Sub Title (in English)
Keyword(1)	Endpoint detection
Keyword(2)	VAD
Keyword(3)	DFE
Keyword(4)	GMM
1st Author's Name	Koichi Yamamoto
1st Author's Affiliation	Multimedia Laboratory, Corporate R&D Center, Toshiba Corp.()
2nd Author's Name	Jabloun Firas
2nd Author's Affiliation	Speech Technology Group, Cambridge Research Laboratory, Toshiba Research Europe Ltd.
3rd Author's Name	Klaus Reinhard
3rd Author's Affiliation	Speech Technology Group, Cambridge Research Laboratory, Toshiba Research Europe Ltd.
4th Author's Name	Akinori Kawamura
4th Author's Affiliation	Multimedia Laboratory, Corporate R&D Center, Toshiba Corp.
Date	2005/12/15
Paper #	NLC2005-93,SP2005-126
Volume (vol)	vol.105
Number (no)	494
Page	pp.pp.-
#Pages	6
Date of Issue