Presentation | 2004/12/13 Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance Yoshiyuki UMEDA, Satoru TSUGE, Fuji REN, Shingo KUROIWA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we propose a distributed speaker recognition method using a non-parametric speaker model and Earth Mover's Distance (EMD). In distributed speaker recognition, the quantized feature vectors are sent to a server. The Gaussian mixture model (GMM), the traditional method used for speaker recognition, is trained using the maximum likelihood approach. However, it is difficult to fit continuous density functions to quantized data. To overcome this problem, the proposed method represents each speaker model with a speaker-dependent VQ code histogram designed by registered feature vectors and directly calculates the distance between the histograms of speaker models and testing quantized feature vectors. To measure the distance between each speaker model and testing data, we use EMD which can calculate the distance between histograms with different bins. We conducted text-independent speaker identification experiments using the proposed method. Compared to results using the traditional GMM, the proposed method yielded relative error reductions of 32% for quantized data. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Distributed Speaker Recognition / speaker identification / non-parametric / Earth Mover's Distance |
Paper # | NLG2004-55,SP2004-95 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2004/12/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance |
Sub Title (in English) | |
Keyword(1) | Distributed Speaker Recognition |
Keyword(2) | speaker identification |
Keyword(3) | non-parametric |
Keyword(4) | Earth Mover's Distance |
1st Author's Name | Yoshiyuki UMEDA |
1st Author's Affiliation | Faculty of Engineering, Tokushima University() |
2nd Author's Name | Satoru TSUGE |
2nd Author's Affiliation | Faculty of Engineering, Tokushima University |
3rd Author's Name | Fuji REN |
3rd Author's Affiliation | Faculty of Engineering, Tokushima University |
4th Author's Name | Shingo KUROIWA |
4th Author's Affiliation | Faculty of Engineering, Tokushima University |
Date | 2004/12/13 |
Paper # | NLG2004-55,SP2004-95 |
Volume (vol) | vol.104 |
Number (no) | 538 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |