Presentation 2023-03-15
Search by topic model to support research of ethnographic materials
Junya Komatsu, Tetsuya Morizumi, Hirotsugu Kinoshita,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In a text retrieval system, there are three types of retrieval methods: word retrieval, textual relationship retrieval, and ontology retrieval. However, none of these methods can search for the conceptual meaning of a text itself when it appears in the context, in other words, a search similar to the act of reading between the lines (we call this latent text search). In this paper, latent texts are considered as random variables. Latent random variables are probabilistic categories for latent texts, to which the topic model is applied. On the other hand, ontology and probability space are defined to have a corollary relationship with each other, and latent texts are represented by interpreting random variables from the ontology side. In this paper, we propose a model that relates entropic co-occurrences on the probability space side as an ontology. Furthermore, we show the validity of defining co-occurrence as the change in entropy between topics caused by changes in the number of topics in the topic model.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) HIMOJI / ontology / latent random variable / topic model
Paper # SITE2022-56,IA2022-79
Date of Issue 2023-03-08 (SITE, IA)

Conference Information
Committee IA / SITE / IPSJ-IOT
Conference Date 2023/3/15(3days)
Place (in Japanese) (See Japanese page)
Place (in English) Maebashi Institute of Technology
Topics (in Japanese) (See Japanese page)
Topics (in English) Internet and Information Ethics Education, etc.
Chair Tomoki Yoshihisa(Osaka Univ.) / Takushi Otani(Kibi International Univ.)
Vice Chair Yusuke Sakumoto(Kwansei Gakuin Univ.) / Yuichiro Hei(KDDI Research) / Hiroshi Yamamoto(Ritsumeikan Univ.) / Soichiro Morishita(Cyber Agent) / Takeo Tatsumi(Open Univ. of Japan)
Secretary Yusuke Sakumoto(Osaka Univ.) / Yuichiro Hei(Kogakuin Univ.) / Hiroshi Yamamoto(Kyushu Inst. of Tech.) / Soichiro Morishita(NRI-Secure) / Takeo Tatsumi(Hokuriku Univ.)
Assistant Daisuke Kotani(Kyoto Univ.) / Ryo Nakamura(Fukuoka Univ.) / Ryo Nakamura(Univ. of Tokyo) / Yusuke Tachibana(Fukuoka Inst. of Tech.)

Paper Information
Registration To Technical Committee on Internet Architecture / Technical Committee on Social Implications of Technology and Information Ethics / Special Interest Group on Internet and Operation Technology
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Search by topic model to support research of ethnographic materials
Sub Title (in English)
Keyword(1) HIMOJI
Keyword(2) ontology
Keyword(3) latent random variable
Keyword(4) topic model
1st Author's Name Junya Komatsu
1st Author's Affiliation Graduate School of Kanagawa University(Kanagawa Univ)
2nd Author's Name Tetsuya Morizumi
2nd Author's Affiliation Kanagawa University(Kanagawa Univ)
3rd Author's Name Hirotsugu Kinoshita
3rd Author's Affiliation Kanagawa University(Kanagawa Univ)
Date 2023-03-15
Paper # SITE2022-56,IA2022-79
Volume (vol) vol.122
Number (no) SITE-433,IA-434
Page pp.pp.15-20(SITE), pp.15-20(IA),
#Pages 6
Date of Issue 2023-03-08 (SITE, IA)