Presentation | 2001/10/10 The Documents Classification andRetrieval by Removing of Words' Noise Noriaki Kawamae, Terumasa Aoki, Hiroshi Yasuda, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper presents a novel approach mapping documents into a conceptual space. Many search systems are based on not concepts but simple words matching method. We have trouble in seeking an information by this method. Because it is hard for us to exchange concepts into words and words' usage differs by people. We define these difference words' noise. Our presented information retrieval method use not words but concepts generating words in documents. We remove the words' noise, infer the concepts from wrods and map documents in the concept space. The relation of documents is measured not words in documents but concepts. The measure based on the concepts approximates the esseptial similarity between documents' contents. Therefore the precision of documents classification improves, and users can search by their concepts. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Information Retrieval / Conceptual Search / Document Classification / Factor Analysis / Latent Semantic Space |
Paper # | NLC 2001-48 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2001/10/10(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | The Documents Classification andRetrieval by Removing of Words' Noise |
Sub Title (in English) | |
Keyword(1) | Information Retrieval |
Keyword(2) | Conceptual Search |
Keyword(3) | Document Classification |
Keyword(4) | Factor Analysis |
Keyword(5) | Latent Semantic Space |
1st Author's Name | Noriaki Kawamae |
1st Author's Affiliation | Research Center for Advanced Research and Technology, The University of Tokyo() |
2nd Author's Name | Terumasa Aoki |
2nd Author's Affiliation | Research Center for Advanced Research and Technology, The University of Tokyo |
3rd Author's Name | Hiroshi Yasuda |
3rd Author's Affiliation | Research Center for Advanced Research and Technology, The University of Tokyo |
Date | 2001/10/10 |
Paper # | NLC 2001-48 |
Volume (vol) | vol.101 |
Number (no) | 351 |
Page | pp.pp.- |
#Pages | 8 |
Date of Issue |