Presentation 2006/12/15
Topic and style adaptation using vocabulary divided PLSA language model by criterion of information
Naoto KURIYAMA, Motoyuki SUZUKI, Akinori ITO, Shozo MAKINO,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) PLSA (Probabilistic Latent Semantic Analysis) is one of promising language model adaptation methods. We propose a new way to combine PLSA and N-gram models by separating the vocabulary into three classes-'topic'-related, 'style'-related and 'general'-related words. This method trains topic vocabulary PLSA model, style vocabulary PLSA model, and general vocabulary unigram model independently, and combines the three models. And we propose an automatic composing method of vocabulary divide criterion, using pattern of word-Class occurrence between newspaper and CSJ. The experimental result showed that the proposed method achieves 15.48% perplexity reduction than conventional PLSA model, about testset of which topic and style feature are not happen together in the training data.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Language model / PLSA / Topic adaptation / Speaker adaptation
Paper # NLC2006-68,SP2006-124
Date of Issue

Conference Information
Committee NLC
Conference Date 2006/12/15(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Topic and style adaptation using vocabulary divided PLSA language model by criterion of information
Sub Title (in English)
Keyword(1) Language model
Keyword(2) PLSA
Keyword(3) Topic adaptation
Keyword(4) Speaker adaptation
1st Author's Name Naoto KURIYAMA
1st Author's Affiliation Graduate School of Engineering, Tohoku University()
2nd Author's Name Motoyuki SUZUKI
2nd Author's Affiliation Graduate School of Engineering, Tohoku University
3rd Author's Name Akinori ITO
3rd Author's Affiliation Graduate School of Engineering, Tohoku University
4th Author's Name Shozo MAKINO
4th Author's Affiliation Graduate School of Engineering, Tohoku University
Date 2006/12/15
Paper # NLC2006-68,SP2006-124
Volume (vol) vol.106
Number (no) 442
Page pp.pp.-
#Pages 6
Date of Issue