Presentation | 1998/5/13 Extraction of Important Words in the Document Set Atsunobu Koizumi, Takashi Okuda, Syuich Itoh, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper describes a method to extract important word from a document set using word frequency. We propose the notion of important word which represenets the inter-document characteristics within the given document set and the content of documents. The Kullback-Leibler distances between p(d|w), which is probability of a document conditioned by a word, and p(d) are calculated and the words are ranked by this quantity. Experimental results are shown and discussed. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | important word / auto extraction / document analysis / divergence |
Paper # | |
Date of Issue |
Conference Information | |
Committee | DE |
---|---|
Conference Date | 1998/5/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Data Engineering (DE) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Extraction of Important Words in the Document Set |
Sub Title (in English) | |
Keyword(1) | important word |
Keyword(2) | auto extraction |
Keyword(3) | document analysis |
Keyword(4) | divergence |
1st Author's Name | Atsunobu Koizumi |
1st Author's Affiliation | Graduate School of Information Systems, Univ.Electro-Communications() |
2nd Author's Name | Takashi Okuda |
2nd Author's Affiliation | Fujitsu |
3rd Author's Name | Syuich Itoh |
3rd Author's Affiliation | Graduate School of Information Systems, Univ.Electro-Communications |
Date | 1998/5/13 |
Paper # | |
Volume (vol) | vol.98 |
Number (no) | 42 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |