Presentation | 2022-11-29 Link Prediction from Text Content by NLP Graph Embedding Tzu-Ying Yang, Hsuan Lei Shao, Chih-Chuan Fan, Wei-Hsin Wang, |
---|---|
PDF Download Page | ![]() |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Abstract This paper is an extended research of the project “The Knowledge Database/ Graph of China-studies”. The main research target is to predict the new research stream from known journal papers by the graph embedding and link prediction. The challenge of our dataset does not include citation relationships; therefore, we might retrieve features of relationships from the content of the papers inside directly. We used keywords collaboration and k-means to reduce dimension, then word2vec and MLP to classify if any two nodes can link in the next round (year). Finally, we could achieve over 90% accuracy in each round which is better than the base-line method (random-forest with Adar and Jaccard score). And we also provide a visualization graph in action. We contribute a pipeline workflow to the rawer bibliography dataset which doesn’t conclude cite-relationship, and this workflow can be used on social media or other text-only datasets. Keywords knowledge graph, link prediction, natural language processing, GNN learning |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | knowledge graph / link prediction / natural language processing / GNN learning / China-studies |
Paper # | NLC2022-9,SP2022-29 |
Date of Issue | 2022-11-22 (NLC, SP) |
Conference Information | |
Committee | NLC / IPSJ-NL / SP / IPSJ-SLP |
---|---|
Conference Date | 2022/11/29(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Mitsuo Yoshida(Univ. of Tsukuba) / 須藤 克仁(奈良先端科学技術大学院大学) / Tomoki Toda(Nagoya Univ.) / 戸田 智基(名古屋大学) |
Vice Chair | Hiroki Sakaji(Univ. of Tokyo) / Takeshi Kobayakawa(NHK) |
Secretary | Hiroki Sakaji(NTT) / Takeshi Kobayakawa(Hiroshima Univ. of Economics) / (株式会社デンソーアイティーラボラトリ) / (北海学園大学) / (東京農工大学) |
Assistant | Kanjin Takahashi(Sansan) / Yasuhiro Ogawa(Nagoya Univ.) / / Ryo Aihara(Mitsubishi Electric) / Daisuke Saito(Univ. of Tokyo) |
Paper Information | |
Registration To | Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language / Technical Committee on Speech / Special Interest Group on Spoken Language Processing |
---|---|
Language | ENG-JTITLE |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Link Prediction from Text Content by NLP Graph Embedding |
Sub Title (in English) | A Study on Chinese Journal Articles |
Keyword(1) | knowledge graph |
Keyword(2) | link prediction |
Keyword(3) | natural language processing |
Keyword(4) | GNN learning |
Keyword(5) | China-studies |
1st Author's Name | Tzu-Ying Yang |
1st Author's Affiliation | National Taiwan Normal University(NTNU) |
2nd Author's Name | Hsuan Lei Shao |
2nd Author's Affiliation | National Taiwan Normal University(NTNU) |
3rd Author's Name | Chih-Chuan Fan |
3rd Author's Affiliation | National Taiwan Normal University(NTNU) |
4th Author's Name | Wei-Hsin Wang |
4th Author's Affiliation | National Taiwan Normal University(NTNU) |
Date | 2022-11-29 |
Paper # | NLC2022-9,SP2022-29 |
Volume (vol) | vol.122 |
Number (no) | NLC-287,SP-288 |
Page | pp.pp.1-4(NLC), pp.1-4(SP), |
#Pages | 4 |
Date of Issue | 2022-11-22 (NLC, SP) |