Paper Abstract and Keywords |
Presentation |
2015-03-03 10:45
[Poster Presentation]
Effectiveness of Local Feature, Group Delay Spectrum, MFCC and Their Combination on Phoneme Recognition Performance Risa Koizumi, Kazuyuki Takagi (UEC) EA2014-108 SIP2014-149 SP2014-171 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In most of current speech processing techniques, MFCC (Mel-Frequency Cepstrum Coefficients) obtained from amplitude spectrum and $Delta$MFCC calculated as time derivative of MFCC are widely used as acoustic features. However, these features consider neither frequency derivative of amplitude spectrum nor phase information of speech waveform. Local feature and group delay spectrum are among the features claimed by previous works to possess such information useful for speech processing. We therefore examine their effectiveness on speech recognition performance. We conducted phoneme recognition experiments using speaker-dependent (10 males, 10 females) phoneme HMMs trained with local feature, group delay spectrum, and MFCC in same speaker, same gender, and different gender conditions. We obtained highest recognition rate by local feature, while the other features showed better performance for some phonemes. Likelihood combination of local feature, group delay spectrum, and MFCC HMMs yielded better phoneme recognition rate than the case in which each HMM was used solely. Results show that it is promising that recognition performance degradation can be alleviated by a combination of local feature, group delay spectrum, and MFCC. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
local feature / group delay spectrum / MFCC / likelihood combination / phoneme recognition / / / |
Reference Info. |
IEICE Tech. Rep., vol. 114, no. 475, SP2014-171, pp. 197-200, March 2015. |
Paper # |
SP2014-171 |
Date of Issue |
2015-02-23 (EA, SIP, SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
EA2014-108 SIP2014-149 SP2014-171 |
Conference Information |
Committee |
SIP EA SP |
Conference Date |
2015-03-02 - 2015-03-03 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
|
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
SP |
Conference Code |
2015-03-SIP-EA-SP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Effectiveness of Local Feature, Group Delay Spectrum, MFCC and Their Combination on Phoneme Recognition Performance |
Sub Title (in English) |
|
Keyword(1) |
local feature |
Keyword(2) |
group delay spectrum |
Keyword(3) |
MFCC |
Keyword(4) |
likelihood combination |
Keyword(5) |
phoneme recognition |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Risa Koizumi |
1st Author's Affiliation |
The University of Electro-Communications (UEC) |
2nd Author's Name |
Kazuyuki Takagi |
2nd Author's Affiliation |
The University of Electro-Communications (UEC) |
3rd Author's Name |
|
3rd Author's Affiliation |
() |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2015-03-03 10:45:00 |
Presentation Time |
90 minutes |
Registration for |
SP |
Paper # |
EA2014-108, SIP2014-149, SP2014-171 |
Volume (vol) |
vol.114 |
Number (no) |
no.473(EA), no.474(SIP), no.475(SP) |
Page |
pp.197-200 |
#Pages |
4 |
Date of Issue |
2015-02-23 (EA, SIP, SP) |
|