Presentation | 2006/12/15 Topic and style adaptation using vocabulary divided PLSA language model by criterion of information Naoto KURIYAMA, Motoyuki SUZUKI, Akinori ITO, Shozo MAKINO, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | PLSA (Probabilistic Latent Semantic Analysis) is one of promising language model adaptation methods. We propose a new way to combine PLSA and N-gram models by separating the vocabulary into three classes-'topic'-related, 'style'-related and 'general'-related words. This method trains topic vocabulary PLSA model, style vocabulary PLSA model, and general vocabulary unigram model independently, and combines the three models. And we propose an automatic composing method of vocabulary divide criterion, using pattern of word-Class occurrence between newspaper and CSJ. The experimental result showed that the proposed method achieves 15.48% perplexity reduction than conventional PLSA model, about testset of which topic and style feature are not happen together in the training data. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Language model / PLSA / Topic adaptation / Speaker adaptation |
Paper # | NLC2006-68,SP2006-124 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2006/12/15(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Topic and style adaptation using vocabulary divided PLSA language model by criterion of information |
Sub Title (in English) | |
Keyword(1) | Language model |
Keyword(2) | PLSA |
Keyword(3) | Topic adaptation |
Keyword(4) | Speaker adaptation |
1st Author's Name | Naoto KURIYAMA |
1st Author's Affiliation | Graduate School of Engineering, Tohoku University() |
2nd Author's Name | Motoyuki SUZUKI |
2nd Author's Affiliation | Graduate School of Engineering, Tohoku University |
3rd Author's Name | Akinori ITO |
3rd Author's Affiliation | Graduate School of Engineering, Tohoku University |
4th Author's Name | Shozo MAKINO |
4th Author's Affiliation | Graduate School of Engineering, Tohoku University |
Date | 2006/12/15 |
Paper # | NLC2006-68,SP2006-124 |
Volume (vol) | vol.106 |
Number (no) | 442 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |