BICに基づく話者モデル選択の教師なし話者インデキシングにおける評価(福祉と音声処理及び一般)

Presentation	2003/10/24 Evaluation of Speaker Model Selection based on Bayesian Information Criterion in Unsupervised Speaker Indexing Masafumi NISHIDA, Tatsuya KAWAHARA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper addresses unsupervised speaker indexing for discussion audio archives. We have performed the speaker indexing using our proposed framework that selects an optimal speaker model (GMM or VQ) based on the BIC. A threshold of the speaker indexing is needed to be determined in advance because the framework is applied to the speaker indexing in the case where the number of speakers is unknown beforehand. Thus, we evaluate robustness of indexing accuracy when varying the threshold and the indexing accuracy when the number of speakers instead of the threshold is given. As a result of comparison with conventional methods, it is demonstrated that the proposed framework can set up the threshold robustly and archives the higher indexing accuracy in both cases where the number of speakers is unknown or given beforehand. The speaker index is useful for speaker adaptation of the acoustic model, which improves the performance of automatic speech recognition.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Speaker model selection / Bayesian information criterion / Unsupervised speaker indexing / Speaker recognition / Speech recognition / Discussions
Paper #	SP2003-103,WIT2003-15
Date of Issue

Paper Information
Registration To	Well-being Information Technology(WIT)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Evaluation of Speaker Model Selection based on Bayesian Information Criterion in Unsupervised Speaker Indexing
Sub Title (in English)
Keyword(1)	Speaker model selection
Keyword(2)	Bayesian information criterion
Keyword(3)	Unsupervised speaker indexing
Keyword(4)	Speaker recognition
Keyword(5)	Speech recognition
Keyword(6)	Discussions
1st Author's Name	Masafumi NISHIDA
1st Author's Affiliation	Graduate School of Science ant Technology, Chiba University()
2nd Author's Name	Tatsuya KAWAHARA
2nd Author's Affiliation	School of Informatics, Kyoto University:PRESTO, Japan Science and Technology Corporation (JST)
Date	2003/10/24
Paper #	SP2003-103,WIT2003-15
Volume (vol)	vol.103
Number (no)	402
Page	pp.pp.-
#Pages	6
Date of Issue