Presentation | 2023-03-15 Search by topic model to support research of ethnographic materials Junya Komatsu, Tetsuya Morizumi, Hirotsugu Kinoshita, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In a text retrieval system, there are three types of retrieval methods: word retrieval, textual relationship retrieval, and ontology retrieval. However, none of these methods can search for the conceptual meaning of a text itself when it appears in the context, in other words, a search similar to the act of reading between the lines (we call this latent text search). In this paper, latent texts are considered as random variables. Latent random variables are probabilistic categories for latent texts, to which the topic model is applied. On the other hand, ontology and probability space are defined to have a corollary relationship with each other, and latent texts are represented by interpreting random variables from the ontology side. In this paper, we propose a model that relates entropic co-occurrences on the probability space side as an ontology. Furthermore, we show the validity of defining co-occurrence as the change in entropy between topics caused by changes in the number of topics in the topic model. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | HIMOJI / ontology / latent random variable / topic model |
Paper # | SITE2022-56,IA2022-79 |
Date of Issue | 2023-03-08 (SITE, IA) |
Conference Information | |
Committee | IA / SITE / IPSJ-IOT |
---|---|
Conference Date | 2023/3/15(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Maebashi Institute of Technology |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Internet and Information Ethics Education, etc. |
Chair | Tomoki Yoshihisa(Osaka Univ.) / Takushi Otani(Kibi International Univ.) |
Vice Chair | Yusuke Sakumoto(Kwansei Gakuin Univ.) / Yuichiro Hei(KDDI Research) / Hiroshi Yamamoto(Ritsumeikan Univ.) / Soichiro Morishita(Cyber Agent) / Takeo Tatsumi(Open Univ. of Japan) |
Secretary | Yusuke Sakumoto(Osaka Univ.) / Yuichiro Hei(Kogakuin Univ.) / Hiroshi Yamamoto(Kyushu Inst. of Tech.) / Soichiro Morishita(NRI-Secure) / Takeo Tatsumi(Hokuriku Univ.) |
Assistant | Daisuke Kotani(Kyoto Univ.) / Ryo Nakamura(Fukuoka Univ.) / Ryo Nakamura(Univ. of Tokyo) / Yusuke Tachibana(Fukuoka Inst. of Tech.) |
Paper Information | |
Registration To | Technical Committee on Internet Architecture / Technical Committee on Social Implications of Technology and Information Ethics / Special Interest Group on Internet and Operation Technology |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Search by topic model to support research of ethnographic materials |
Sub Title (in English) | |
Keyword(1) | HIMOJI |
Keyword(2) | ontology |
Keyword(3) | latent random variable |
Keyword(4) | topic model |
1st Author's Name | Junya Komatsu |
1st Author's Affiliation | Graduate School of Kanagawa University(Kanagawa Univ) |
2nd Author's Name | Tetsuya Morizumi |
2nd Author's Affiliation | Kanagawa University(Kanagawa Univ) |
3rd Author's Name | Hirotsugu Kinoshita |
3rd Author's Affiliation | Kanagawa University(Kanagawa Univ) |
Date | 2023-03-15 |
Paper # | SITE2022-56,IA2022-79 |
Volume (vol) | vol.122 |
Number (no) | SITE-433,IA-434 |
Page | pp.pp.15-20(SITE), pp.15-20(IA), |
#Pages | 6 |
Date of Issue | 2023-03-08 (SITE, IA) |