Presentation 2015-03-03
Effectiveness of Local Feature, Group Delay Spectrum, MFCC and Their Combination on Pheneme Recognition Performance(Poster Presentation)
Risa KOIZUMI, Kazuyuki TAKAGI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In most of current speech processing techniques, MFCC (Mel-Frequency Cepstrum Coefficients) obtained from amplitude spectrum and AMFCC calculated as time derivative of MFCC are widely used as acoustic features. However, these features consider neither frequency derivative of amplitude spectrum nor phase information of speech waveform. Local feature and group delay spectrum are among the features claimed by previous works to possess such information useful for speech processing. We therefore examine their effectiveness on speech recognition performance. We conducted phoneme recognition experiments using speaker-dependent (10 males, 10 females) phoneme HMMs trained with local feature, group delay spectrum, and MFCC, in same speaker, same gender, and different gender conditions. We obtained highest recognition rate by local feature, while the other features showed better performance for some phonemes. Likelihood combination of local feature, group delay spectrum, and MFCC HMMs yielded better phoneme recognition rate than the case in which each HMM was used solely. Results show that it is promising that recognition performance degradation can be alleviated by a combination of local feature, group delay spectrum, and MFCC.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) local feature / group delay spectrum / MFCC / likelihood combination / phoneme recognition
Paper # EA2014-108,SIP2014-149,SP2014-171
Date of Issue

Conference Information
Committee SIP
Conference Date 2015/2/23(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Signal Processing (SIP)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Effectiveness of Local Feature, Group Delay Spectrum, MFCC and Their Combination on Pheneme Recognition Performance(Poster Presentation)
Sub Title (in English)
Keyword(1) local feature
Keyword(2) group delay spectrum
Keyword(3) MFCC
Keyword(4) likelihood combination
Keyword(5) phoneme recognition
1st Author's Name Risa KOIZUMI
1st Author's Affiliation The University of Electro-Communications()
2nd Author's Name Kazuyuki TAKAGI
2nd Author's Affiliation The University of Electro-Communications
Date 2015-03-03
Paper # EA2014-108,SIP2014-149,SP2014-171
Volume (vol) vol.114
Number (no) 474
Page pp.pp.-
#Pages 4
Date of Issue