Speaker Recognition without Feature Extraction Process

講演名	2004/12/13 Speaker Recognition without Feature Extraction Process ,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)
抄録(英)	By employing the dual Penalized Logistic Regression Machine (dPLRM), this paper explores a speaker identification method which does not require feature extraction process depending on a prior knowledge. The induction machine can discover implicitly speaker characteristics relevant to discrimination only from a set of training data by the mechanism of the kernel regression. Our text-independent speaker identification experiments with training data uttered by 1 0 male speakers in three different sessions show that the proposed method is competitive with the conventional Gaussian mixture model (GMM)-based method with 26-dimensional Mel-frequency cepstrum (MFCC) feature even though our method handle directly coarse data of 256-dimensional log-power spectrum. It is also shown that our method outperforms the GMM-based method especially as the amount of training data becomes smaller.
キーワード(和)
キーワード(英)	Speaker recognition / Speaker identification / Kernel regression / dual Penalized Logistic Regression Machine / implicit feature extraction
資料番号	NLC2004-54,SP2004-94
発行日

講演論文情報詳細
申込み研究会	Natural Language Understanding and Models of Communication (NLC)
本文の言語	ENG
タイトル（和）
サブタイトル（和）
タイトル（英）	Speaker Recognition without Feature Extraction Process
サブタイトル（和）
キーワード(1)（和/英）	/ Speaker recognition
第 1 著者氏名（和/英）	/ Tomoko MATSUI
第 1 著者所属（和/英）	The Institute of Statistical Mathematics
発表年月日	2004/12/13
資料番号	NLC2004-54,SP2004-94
巻番号（vol）	vol.104
号番号（no）	538
ページ範囲	pp.-
ページ数	6
発行日