Presentation 2010-06-28
Two step adjustment technique of term weight
Hiroya YANO, Tai NAKAJIMA, Hayato YAMANA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) TF・IDF method is one of the methods to weight terms in the field of document retrieval. IDF value shows the degree of how a term is difficult to appear in the document set, and depends on the document set to be retrieved. Therefore, the problem is that, even if a term is difficult to appear in the same field of document set as query(which means the term is highly specific in the document), IDF value of term which appears easily in the document set to be retrieved is small. In this paper, we propose and study two step adjustment technique of term weight. In the first step, we get documents related to query using vector space model. In the next step, we retrieve relevant documents using IDF calculated from the document set acquired in the first step. Experiments using NTCIR-1 IR task collection indicate that, the precision of proposed method is improved about 7.1 percent comparing to that vector space model, and is almost the same value of the precision which get the highest in NTCIR-1.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) document retrieval / TF・IDF / pseudo-relevance feedback
Paper # DE2010-9
Date of Issue

Conference Information
Committee DE
Conference Date 2010/6/21(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Data Engineering (DE)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Two step adjustment technique of term weight
Sub Title (in English)
Keyword(1) document retrieval
Keyword(2) TF・IDF
Keyword(3) pseudo-relevance feedback
1st Author's Name Hiroya YANO
1st Author's Affiliation Graduate School of Fundamental Science and Engineering, Waseda University()
2nd Author's Name Tai NAKAJIMA
2nd Author's Affiliation Graduate School of Fundamental Science and Engineering, Waseda University
3rd Author's Name Hayato YAMANA
3rd Author's Affiliation Faculty of Science and Engineering, Waseda University:National Institute of Informatics
Date 2010-06-28
Paper # DE2010-9
Volume (vol) vol.110
Number (no) 107
Page pp.pp.-
#Pages 6
Date of Issue