Presentation 2001/7/9
The Word Clustering Based on Statistical Model
Noriaki Kawamae, Terumasa Aoki, Hiroshi Yasuda,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) The existing search systems are based on simple word matching method. Therefore the variety of natural language prevent user search activity. The thesaurus is one answer to this problem. We propose a novel statistical word clustering to construct the thesaurus automatically. Here, the concepts are extracted from documents and words in documents are clustering into the same concepts. We can construct the thesaurus that is specialized on a domain and in a function by the word clustering. The proposed method is applied to a set of conference documents to examine the effectiveness of the generated word clustering.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Information Retrieval / Thesaurus / Conceptual Search / Word Classification / Factor Analysis
Paper # NLC2001-16
Date of Issue

Conference Information
Committee NLC
Conference Date 2001/7/9(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) The Word Clustering Based on Statistical Model
Sub Title (in English)
Keyword(1) Information Retrieval
Keyword(2) Thesaurus
Keyword(3) Conceptual Search
Keyword(4) Word Classification
Keyword(5) Factor Analysis
1st Author's Name Noriaki Kawamae
1st Author's Affiliation Research Center for Advanced Research and Technology, The University of Tokyo()
2nd Author's Name Terumasa Aoki
2nd Author's Affiliation Research Center for Advanced Research and Technology, The University of Tokyo
3rd Author's Name Hiroshi Yasuda
3rd Author's Affiliation Research Center for Advanced Research and Technology, The University of Tokyo
Date 2001/7/9
Paper # NLC2001-16
Volume (vol) vol.101
Number (no) 189
Page pp.pp.-
#Pages 6
Date of Issue