講演名 2004/12/13
Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance
,
PDFダウンロードページ PDFダウンロードページへ
抄録(和)
抄録(英) In this paper, we propose a distributed speaker recognition method using a non-parametric speaker model and Earth Mover's Distance (EMD). In distributed speaker recognition, the quantized feature vectors are sent to a server. The Gaussian mixture model (GMM), the traditional method used for speaker recognition, is trained using the maximum likelihood approach. However, it is difficult to fit continuous density functions to quantized data. To overcome this problem, the proposed method represents each speaker model with a speaker-dependent VQ code histogram designed by registered feature vectors and directly calculates the distance between the histograms of speaker models and testing quantized feature vectors. To measure the distance between each speaker model and testing data, we use EMD which can calculate the distance between histograms with different bins. We conducted text-independent speaker identification experiments using the proposed method. Compared to results using the traditional GMM, the proposed method yielded relative error reductions of 32% for quantized data.
キーワード(和)
キーワード(英) Distributed Speaker Recognition / speaker identification / non-parametric / Earth Mover's Distance
資料番号 NLG2004-55,SP2004-95
発行日

研究会情報
研究会 SP
開催期間 2004/12/13(から1日開催)
開催地(和)
開催地(英)
テーマ(和)
テーマ(英)
委員長氏名(和)
委員長氏名(英)
副委員長氏名(和)
副委員長氏名(英)
幹事氏名(和)
幹事氏名(英)
幹事補佐氏名(和)
幹事補佐氏名(英)

講演論文情報詳細
申込み研究会 Speech (SP)
本文の言語 ENG
タイトル(和)
サブタイトル(和)
タイトル(英) Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance
サブタイトル(和)
キーワード(1)(和/英) / Distributed Speaker Recognition
第 1 著者 氏名(和/英) / Yoshiyuki UMEDA
第 1 著者 所属(和/英)
Faculty of Engineering, Tokushima University
発表年月日 2004/12/13
資料番号 NLG2004-55,SP2004-95
巻番号(vol) vol.104
号番号(no) 541
ページ範囲 pp.-
ページ数 6
発行日