Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance

Presentation	2004/12/13 Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance Yoshiyuki UMEDA, Satoru TSUGE, Fuji REN, Shingo KUROIWA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, we propose a distributed speaker recognition method using a non-parametric speaker model and Earth Mover's Distance (EMD). In distributed speaker recognition, the quantized feature vectors are sent to a server. The Gaussian mixture model (GMM), the traditional method used for speaker recognition, is trained using the maximum likelihood approach. However, it is difficult to fit continuous density functions to quantized data. To overcome this problem, the proposed method represents each speaker model with a speaker-dependent VQ code histogram designed by registered feature vectors and directly calculates the distance between the histograms of speaker models and testing quantized feature vectors. To measure the distance between each speaker model and testing data, we use EMD which can calculate the distance between histograms with different bins. We conducted text-independent speaker identification experiments using the proposed method. Compared to results using the traditional GMM, the proposed method yielded relative error reductions of 32% for quantized data.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Distributed Speaker Recognition / speaker identification / non-parametric / Earth Mover's Distance
Paper #	NLG2004-55,SP2004-95
Date of Issue

Conference Information
Committee	NLC
Conference Date	2004/12/13(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance
Sub Title (in English)
Keyword(1)	Distributed Speaker Recognition
Keyword(2)	speaker identification
Keyword(3)	non-parametric
Keyword(4)	Earth Mover's Distance
1st Author's Name	Yoshiyuki UMEDA
1st Author's Affiliation	Faculty of Engineering, Tokushima University()
2nd Author's Name	Satoru TSUGE
2nd Author's Affiliation	Faculty of Engineering, Tokushima University
3rd Author's Name	Fuji REN
3rd Author's Affiliation	Faculty of Engineering, Tokushima University
4th Author's Name	Shingo KUROIWA
4th Author's Affiliation	Faculty of Engineering, Tokushima University
Date	2004/12/13
Paper #	NLG2004-55,SP2004-95
Volume (vol)	vol.104
Number (no)	538
Page	pp.pp.-
#Pages	6
Date of Issue