Presentation 2002/7/8
Difference Evaluation between Documents Using Topic Difference Factor Analysis
Takahiko KAWATANI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper proposes a method to extract distinctive parts that can represent major differences from a given document when the given document is compared with another document. The method is required to satisfy the following condition. That is, (1) the extracted part can not only represent the differences between both documents but also be important in the given document, (2) it can explain how the extracted part is distinctive, (3) it can obtain a distinctiveness score for each sentence or word. To satisfy these conditions, this paper adopts Topic Difference Factor Analysis (TDFA) proposed by the author. Through experiments that extract distinctive sentences from a given document when compared with a very similar document, validness of the method was confirmed.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Topic Difference Factor / TDFA / Document Difference / Distinctiveness
Paper # NLC2002-16
Date of Issue

Conference Information
Committee NLC
Conference Date 2002/7/8(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Difference Evaluation between Documents Using Topic Difference Factor Analysis
Sub Title (in English)
Keyword(1) Topic Difference Factor
Keyword(2) TDFA
Keyword(3) Document Difference
Keyword(4) Distinctiveness
1st Author's Name Takahiko KAWATANI
1st Author's Affiliation Hewlett-Packard Labs Japan Hewlett-Packard Japan()
Date 2002/7/8
Paper # NLC2002-16
Volume (vol) vol.102
Number (no) 199
Page pp.pp.-
#Pages 6
Date of Issue