Presentation | 2020-09-02 A method for clarifying differences between feature distributions of various solutions about topic model Toshio Uchiyama, Tsukasa Hokimoto, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Probabilistic Latent Semantic Analysis and Latent Dirichlet analysis are known as topic models to analyze text data and images. When parameters (= solution) of the topic model are obtained by the optimization algorithm, various solutions are reached due to differences in algorithms, initial values. In a related study, we showed how to visualize the distribution of various solutions. However, since information on specific characteristics is not represented, it is unclear what and how they differ, and thus difficulties remain in using the results. Therefore, in order to select the appropriate solution for the application, we propose a method for identifying the similarities and differences between the solutions by focusing on the feature distribution of the solutions. We show that this makes it possible to discover both typical and unexpected solutions, taking into account the characteristics. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | topic model / diversity of solutions / feature distribution / Jaccard coefficient / similarity between solutions |
Paper # | PRMU2020-8 |
Date of Issue | 2020-08-26 (PRMU) |
Conference Information | |
Committee | PRMU |
---|---|
Conference Date | 2020/9/2(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Online |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Multi-modal, Cross-modal |
Chair | Yoichi Sato(Univ. of Tokyo) |
Vice Chair | Akisato Kimura(NTT) / Masakazu Iwamura(Osaka Pref. Univ.) |
Secretary | Akisato Kimura(Mobility Technologies) / Masakazu Iwamura(Chubu Univ.) |
Assistant | Takashi Shibata(NTT) / Masashi Nishiyama(Tottori Univ.) |
Paper Information | |
Registration To | Technical Committee on Pattern Recognition and Media Understanding |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A method for clarifying differences between feature distributions of various solutions about topic model |
Sub Title (in English) | |
Keyword(1) | topic model |
Keyword(2) | diversity of solutions |
Keyword(3) | feature distribution |
Keyword(4) | Jaccard coefficient |
Keyword(5) | similarity between solutions |
1st Author's Name | Toshio Uchiyama |
1st Author's Affiliation | Hokkaido Information University(HIU) |
2nd Author's Name | Tsukasa Hokimoto |
2nd Author's Affiliation | Hokkaido Information University(HIU) |
Date | 2020-09-02 |
Paper # | PRMU2020-8 |
Volume (vol) | vol.120 |
Number (no) | PRMU-154 |
Page | pp.pp.1-6(PRMU), |
#Pages | 6 |
Date of Issue | 2020-08-26 (PRMU) |