Presentation 2004/2/5
Creating Search Space for Related Information Retrieval using Web Content Similarities
Kuangmin TAN, Aki KOBAYASHI, Katsunori YAMAOKA, Yoshinori SAKAI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) For related information retrieval on the Web which is formed by hyperlinks, restricting the search space to a specific topic can increase the precision of the search results. One of the easiest ways to do it, is to limit the search space to Web contents within a fixed link distance from the Web page specified by the user. However, the relationship between the link distance and the semantic distance of the Web contents is not proportional. Thus, using only a fixed link distance is insufficient to remove unnecessary search results. In this paper, we propose that the strength of the relationship between two linked Web contents, is represented by using the similarity of their contents, and each similarity reflects the link distances respectively. When searching, the search space is formed by gathering Web contents which are closely related according to similarities, when tracing both forward and backward links from the Web page specified by the user. To verify the effectiveness of our method, we applied it to the actual Web link space. The experimental result shows that the precision of the search rasults increases using the proposed method.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Web Information Retrieval / Search Space / Similarity
Paper # IN2003-191
Date of Issue

Conference Information
Committee IN
Conference Date 2004/2/5(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Information Networks (IN)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Creating Search Space for Related Information Retrieval using Web Content Similarities
Sub Title (in English)
Keyword(1) Web Information Retrieval
Keyword(2) Search Space
Keyword(3) Similarity
1st Author's Name Kuangmin TAN
1st Author's Affiliation Dept. of Communications and Integrated Systems, Graduate School of Science and Engineering, Tokyo Institute of Technology()
2nd Author's Name Aki KOBAYASHI
2nd Author's Affiliation Dept. of Communications and Integrated Systems, Graduate School of Science and Engineering, Tokyo Institute of Technology
3rd Author's Name Katsunori YAMAOKA
3rd Author's Affiliation Global Scientific Information and Computing Center Systems, Tokyo Institute of Technology
4th Author's Name Yoshinori SAKAI
4th Author's Affiliation Dept. of Communications and Integrated Systems, Graduate School of Science and Engineering, Tokyo Institute of Technology
Date 2004/2/5
Paper # IN2003-191
Volume (vol) vol.103
Number (no) 650
Page pp.pp.-
#Pages 6
Date of Issue