Presentation 2002/7/9
A method for generating microscopic clusters of text-based documents
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Grouping documents with strong associations into small sizes of clusters could be an effective way to improve the performance and the visibility of text-based retrieval systems. In this paper, we investigate the feasibility of one of such microscopic clustering methods: the proposed method detects strings that are 'recycled' in a given document collection to identify documents dealing with the same topic and also originating from the same information source.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) recycled strings / mutual information / document clustering / suffix-tree
Paper # NLC2002-26
Date of Issue

Conference Information
Committee NLC
Conference Date 2002/7/9(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Vice Chair

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A method for generating microscopic clusters of text-based documents
Sub Title (in English)
Keyword(1) recycled strings
Keyword(2) mutual information
Keyword(3) document clustering
Keyword(4) suffix-tree
1st Author's Name Akiko AIZAWA
1st Author's Affiliation National Institute of Informatics()
Date 2002/7/9
Paper # NLC2002-26
Volume (vol) vol.102
Number (no) 200
Page pp.pp.-
#Pages 7
Date of Issue