Presentation 2007/12/13
A study and an examination on adaptive integration of multiple voice activity detection
Masakiyo FUJIMOTO, Kentaro ISHIZUKA, Tomohiro NAKATANI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The VAD method proposed in this paper integrates multiple speech features and a signal decision scheme, namely the speech periodic to aperiodic component ratio and a switching Kalman filter. The integration is carried out by using the weighted sum of likelihoods outputted from each VAD (stream). The stream weight is decided adaptively each short time frame. The evaluation is carried out by using a VAD evaluation framework, CENSREC-1-C. The evaluation results revealed that the proposed method significantly outperforms the baseline results of CENSREC-1-C as regards VAD accuracy in real environments. In addition, we examine the method of likelihoods weighting through the experiments.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) voice activity detection / periodic to aperiodic component ratio / switching Kalman filter / adaptive integration
Paper # NLC2007-34,SP2007-97
Date of Issue

Conference Information
Committee NLC
Conference Date 2007/12/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A study and an examination on adaptive integration of multiple voice activity detection
Sub Title (in English)
Keyword(1) voice activity detection
Keyword(2) periodic to aperiodic component ratio
Keyword(3) switching Kalman filter
Keyword(4) adaptive integration
1st Author's Name Masakiyo FUJIMOTO
1st Author's Affiliation NTT Communicaition Science Laboratories, NTT Corp.()
2nd Author's Name Kentaro ISHIZUKA
2nd Author's Affiliation NTT Communicaition Science Laboratories, NTT Corp.
3rd Author's Name Tomohiro NAKATANI
3rd Author's Affiliation NTT Communicaition Science Laboratories, NTT Corp.
Date 2007/12/13
Paper # NLC2007-34,SP2007-97
Volume (vol) vol.107
Number (no) 405
Page pp.pp.-
#Pages 6
Date of Issue