Presentation 2006-07-13
Name Disambiguation in Web Search Using Knowledge Base
VU Quang MINH, Tomonari MASADA, Atsuhiro TAKASU, Jun ADACHI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Results of queries by personal names often contain documents related to several people because of namesake problem. In order to discriminate documents related to different people, it is required an effective method to measure document similarities and to find out relevant documents of the same person. Some previous researches have used cosine similarity method or have tried to extract common named entities for measuring similarities. We propose a new method which uses web directories as knowledge base to find out shared contexts in document pairs and uses the measurement of shared contexts as similarities between document pairs. Experimental results show that our proposed method outperforms cosine similarity method and common named entities method.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Personal name searching / name disambiguation / document similarity
Paper # DE2006-74
Date of Issue

Conference Information
Committee DE
Conference Date 2006/7/6(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Data Engineering (DE)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Name Disambiguation in Web Search Using Knowledge Base
Sub Title (in English)
Keyword(1) Personal name searching
Keyword(2) name disambiguation
Keyword(3) document similarity
1st Author's Name VU Quang MINH
1st Author's Affiliation Graduate School of Information Science and Technology, The University of Tokyo()
2nd Author's Name Tomonari MASADA
2nd Author's Affiliation National Institute of Informatics
3rd Author's Name Atsuhiro TAKASU
3rd Author's Affiliation National Institute of Informatics
4th Author's Name Jun ADACHI
4th Author's Affiliation National Institute of Informatics
Date 2006-07-13
Paper # DE2006-74
Volume (vol) vol.106
Number (no) 149
Page pp.pp.-
#Pages 6
Date of Issue