Presentation | 2005-07-13 Clustering News Articles using Named Entities Hiroyuki TODA, Ryoji KATAOKA, Hiroyuki KITAGAWA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Due to the growth of the Internet, the amount of information accessible to the public has almost exploded. Especially, news articles are intensively used for latest news watching, retrieving interesting information from news archives and so on. In news archive services, there is a demand to group news articles describing the same event. To address this problem, we use Named Entities in news articles to tell which events the articles describe. In this paper, we present the results of experiments to measure the appearance tendency of named entities in news articles and accuracy of clustering taking named entities into consideration, and discuss validity of the proposed approach. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Information Retrieval / Document Clustering / Named Entities / News Articles |
Paper # | DE2005-53 |
Date of Issue |
Conference Information | |
Committee | DE |
---|---|
Conference Date | 2005/7/6(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Data Engineering (DE) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Clustering News Articles using Named Entities |
Sub Title (in English) | |
Keyword(1) | Information Retrieval |
Keyword(2) | Document Clustering |
Keyword(3) | Named Entities |
Keyword(4) | News Articles |
1st Author's Name | Hiroyuki TODA |
1st Author's Affiliation | NTT Cyber Solutions Laboratories, NTT Corporation:Graduate School of Systems and Information Engineering, University of Tsukuba() |
2nd Author's Name | Ryoji KATAOKA |
2nd Author's Affiliation | NTT Cyber Solutions Laboratories, NTT Corporation |
3rd Author's Name | Hiroyuki KITAGAWA |
3rd Author's Affiliation | Graduate School of Systems and Information Engineering, University of Tsukuba:Center for Computational Sciences, University of Tsukuba |
Date | 2005-07-13 |
Paper # | DE2005-53 |
Volume (vol) | vol.105 |
Number (no) | 171 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |