Presentation 2020-09-02
A method for clarifying differences between feature distributions of various solutions about topic model
Toshio Uchiyama, Tsukasa Hokimoto,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Probabilistic Latent Semantic Analysis and Latent Dirichlet analysis are known as topic models to analyze text data and images. When parameters (= solution) of the topic model are obtained by the optimization algorithm, various solutions are reached due to differences in algorithms, initial values. In a related study, we showed how to visualize the distribution of various solutions. However, since information on specific characteristics is not represented, it is unclear what and how they differ, and thus difficulties remain in using the results. Therefore, in order to select the appropriate solution for the application, we propose a method for identifying the similarities and differences between the solutions by focusing on the feature distribution of the solutions. We show that this makes it possible to discover both typical and unexpected solutions, taking into account the characteristics.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) topic model / diversity of solutions / feature distribution / Jaccard coefficient / similarity between solutions
Paper # PRMU2020-8
Date of Issue 2020-08-26 (PRMU)

Conference Information
Committee PRMU
Conference Date 2020/9/2(1days)
Place (in Japanese) (See Japanese page)
Place (in English) Online
Topics (in Japanese) (See Japanese page)
Topics (in English) Multi-modal, Cross-modal
Chair Yoichi Sato(Univ. of Tokyo)
Vice Chair Akisato Kimura(NTT) / Masakazu Iwamura(Osaka Pref. Univ.)
Secretary Akisato Kimura(Mobility Technologies) / Masakazu Iwamura(Chubu Univ.)
Assistant Takashi Shibata(NTT) / Masashi Nishiyama(Tottori Univ.)

Paper Information
Registration To Technical Committee on Pattern Recognition and Media Understanding
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A method for clarifying differences between feature distributions of various solutions about topic model
Sub Title (in English)
Keyword(1) topic model
Keyword(2) diversity of solutions
Keyword(3) feature distribution
Keyword(4) Jaccard coefficient
Keyword(5) similarity between solutions
1st Author's Name Toshio Uchiyama
1st Author's Affiliation Hokkaido Information University(HIU)
2nd Author's Name Tsukasa Hokimoto
2nd Author's Affiliation Hokkaido Information University(HIU)
Date 2020-09-02
Paper # PRMU2020-8
Volume (vol) vol.120
Number (no) PRMU-154
Page pp.pp.1-6(PRMU),
#Pages 6
Date of Issue 2020-08-26 (PRMU)