Presentation 2001/10/10
The Documents Classification andRetrieval by Removing of Words' Noise
Noriaki Kawamae, Terumasa Aoki, Hiroshi Yasuda,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper presents a novel approach mapping documents into a conceptual space. Many search systems are based on not concepts but simple words matching method. We have trouble in seeking an information by this method. Because it is hard for us to exchange concepts into words and words' usage differs by people. We define these difference words' noise. Our presented information retrieval method use not words but concepts generating words in documents. We remove the words' noise, infer the concepts from wrods and map documents in the concept space. The relation of documents is measured not words in documents but concepts. The measure based on the concepts approximates the esseptial similarity between documents' contents. Therefore the precision of documents classification improves, and users can search by their concepts.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Information Retrieval / Conceptual Search / Document Classification / Factor Analysis / Latent Semantic Space
Paper # NLC 2001-48
Date of Issue

Conference Information
Committee NLC
Conference Date 2001/10/10(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) The Documents Classification andRetrieval by Removing of Words' Noise
Sub Title (in English)
Keyword(1) Information Retrieval
Keyword(2) Conceptual Search
Keyword(3) Document Classification
Keyword(4) Factor Analysis
Keyword(5) Latent Semantic Space
1st Author's Name Noriaki Kawamae
1st Author's Affiliation Research Center for Advanced Research and Technology, The University of Tokyo()
2nd Author's Name Terumasa Aoki
2nd Author's Affiliation Research Center for Advanced Research and Technology, The University of Tokyo
3rd Author's Name Hiroshi Yasuda
3rd Author's Affiliation Research Center for Advanced Research and Technology, The University of Tokyo
Date 2001/10/10
Paper # NLC 2001-48
Volume (vol) vol.101
Number (no) 351
Page pp.pp.-
#Pages 8
Date of Issue