Presentation | 2019-12-20 Analysis method and results of solution diversity about topic model Toshio Uchiyama, Tsukasa Hokimoto, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Probabilistic Latent Semantic Analysis and Latent Dirichlet analysis are known as topic models to analyze text data and images. When parameters (= solution) of the topic model are obtained by the optimization algorithm, various solutions are reached due to differences in algorithms, initial values. However, there are many similar solutions, and it is redundant to treat them as different findings. To find non-redundant solution sets, We propose a method to calculate normalized mutual information (NMI) as similarity (inner product) between solutions, apply multidimensional scale method to coordinate values to them, and enable analysis and visualization. Distribution situation of solutions was visualized by experiment using various text data. We also clarified the dependence of representative algorithms (maximum a posterior estimation, collapsed Gibbs sampling, collapsed variational Bayesian inference) and initial values, and confirmed the usefulness of the proposed method. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | topic model / diversity of solutions / Normalized Mutual Information / Multidimensional scaling |
Paper # | PRMU2019-54 |
Date of Issue | 2019-12-12 (PRMU) |
Conference Information | |
Committee | PRMU |
---|---|
Conference Date | 2019/12/19(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Yoichi Sato(Univ. of Tokyo) |
Vice Chair | Toru Tamaki(Hiroshima Univ.) / Akisato Kimura(NTT) |
Secretary | Toru Tamaki(NTT) / Akisato Kimura(OMRON SINICX) |
Assistant | Yusuke Uchida(DeNA) / Takayoshi Yamashita(Chubu Univ.) |
Paper Information | |
Registration To | Technical Committee on Pattern Recognition and Media Understanding |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Analysis method and results of solution diversity about topic model |
Sub Title (in English) | |
Keyword(1) | topic model |
Keyword(2) | diversity of solutions |
Keyword(3) | Normalized Mutual Information |
Keyword(4) | Multidimensional scaling |
1st Author's Name | Toshio Uchiyama |
1st Author's Affiliation | Hokkaido Information University(HIU) |
2nd Author's Name | Tsukasa Hokimoto |
2nd Author's Affiliation | Hokkaido Information University(HIU) |
Date | 2019-12-20 |
Paper # | PRMU2019-54 |
Volume (vol) | vol.119 |
Number (no) | PRMU-347 |
Page | pp.pp.49-54(PRMU), |
#Pages | 6 |
Date of Issue | 2019-12-12 (PRMU) |