Presentation | 2015-03-03 Effectiveness of Local Feature, Group Delay Spectrum, MFCC and Their Combination on Pheneme Recognition Performance(Poster Presentation) Risa KOIZUMI, Kazuyuki TAKAGI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In most of current speech processing techniques, MFCC (Mel-Frequency Cepstrum Coefficients) obtained from amplitude spectrum and AMFCC calculated as time derivative of MFCC are widely used as acoustic features. However, these features consider neither frequency derivative of amplitude spectrum nor phase information of speech waveform. Local feature and group delay spectrum are among the features claimed by previous works to possess such information useful for speech processing. We therefore examine their effectiveness on speech recognition performance. We conducted phoneme recognition experiments using speaker-dependent (10 males, 10 females) phoneme HMMs trained with local feature, group delay spectrum, and MFCC, in same speaker, same gender, and different gender conditions. We obtained highest recognition rate by local feature, while the other features showed better performance for some phonemes. Likelihood combination of local feature, group delay spectrum, and MFCC HMMs yielded better phoneme recognition rate than the case in which each HMM was used solely. Results show that it is promising that recognition performance degradation can be alleviated by a combination of local feature, group delay spectrum, and MFCC. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | local feature / group delay spectrum / MFCC / likelihood combination / phoneme recognition |
Paper # | EA2014-108,SIP2014-149,SP2014-171 |
Date of Issue |
Conference Information | |
Committee | SIP |
---|---|
Conference Date | 2015/2/23(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Signal Processing (SIP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Effectiveness of Local Feature, Group Delay Spectrum, MFCC and Their Combination on Pheneme Recognition Performance(Poster Presentation) |
Sub Title (in English) | |
Keyword(1) | local feature |
Keyword(2) | group delay spectrum |
Keyword(3) | MFCC |
Keyword(4) | likelihood combination |
Keyword(5) | phoneme recognition |
1st Author's Name | Risa KOIZUMI |
1st Author's Affiliation | The University of Electro-Communications() |
2nd Author's Name | Kazuyuki TAKAGI |
2nd Author's Affiliation | The University of Electro-Communications |
Date | 2015-03-03 |
Paper # | EA2014-108,SIP2014-149,SP2014-171 |
Volume (vol) | vol.114 |
Number (no) | 474 |
Page | pp.pp.- |
#Pages | 4 |
Date of Issue |