Presentation 2004/12/13
Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance
Yoshiyuki UMEDA, Satoru TSUGE, Fuji REN, Shingo KUROIWA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this paper, we propose a distributed speaker recognition method using a non-parametric speaker model and Earth Mover's Distance (EMD). In distributed speaker recognition, the quantized feature vectors are sent to a server. The Gaussian mixture model (GMM), the traditional method used for speaker recognition, is trained using the maximum likelihood approach. However, it is difficult to fit continuous density functions to quantized data. To overcome this problem, the proposed method represents each speaker model with a speaker-dependent VQ code histogram designed by registered feature vectors and directly calculates the distance between the histograms of speaker models and testing quantized feature vectors. To measure the distance between each speaker model and testing data, we use EMD which can calculate the distance between histograms with different bins. We conducted text-independent speaker identification experiments using the proposed method. Compared to results using the traditional GMM, the proposed method yielded relative error reductions of 32% for quantized data.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Distributed Speaker Recognition / speaker identification / non-parametric / Earth Mover's Distance
Paper # NLG2004-55,SP2004-95
Date of Issue

Conference Information
Committee NLC
Conference Date 2004/12/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance
Sub Title (in English)
Keyword(1) Distributed Speaker Recognition
Keyword(2) speaker identification
Keyword(3) non-parametric
Keyword(4) Earth Mover's Distance
1st Author's Name Yoshiyuki UMEDA
1st Author's Affiliation Faculty of Engineering, Tokushima University()
2nd Author's Name Satoru TSUGE
2nd Author's Affiliation Faculty of Engineering, Tokushima University
3rd Author's Name Fuji REN
3rd Author's Affiliation Faculty of Engineering, Tokushima University
4th Author's Name Shingo KUROIWA
4th Author's Affiliation Faculty of Engineering, Tokushima University
Date 2004/12/13
Paper # NLG2004-55,SP2004-95
Volume (vol) vol.104
Number (no) 538
Page pp.pp.-
#Pages 6
Date of Issue