Presentation 1998/5/13
Extraction of Important Words in the Document Set
Atsunobu Koizumi, Takashi Okuda, Syuich Itoh,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper describes a method to extract important word from a document set using word frequency. We propose the notion of important word which represenets the inter-document characteristics within the given document set and the content of documents. The Kullback-Leibler distances between p(d|w), which is probability of a document conditioned by a word, and p(d) are calculated and the words are ranked by this quantity. Experimental results are shown and discussed.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) important word / auto extraction / document analysis / divergence
Paper #
Date of Issue

Conference Information
Committee DE
Conference Date 1998/5/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Data Engineering (DE)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Extraction of Important Words in the Document Set
Sub Title (in English)
Keyword(1) important word
Keyword(2) auto extraction
Keyword(3) document analysis
Keyword(4) divergence
1st Author's Name Atsunobu Koizumi
1st Author's Affiliation Graduate School of Information Systems, Univ.Electro-Communications()
2nd Author's Name Takashi Okuda
2nd Author's Affiliation Fujitsu
3rd Author's Name Syuich Itoh
3rd Author's Affiliation Graduate School of Information Systems, Univ.Electro-Communications
Date 1998/5/13
Paper #
Volume (vol) vol.98
Number (no) 42
Page pp.pp.-
#Pages 6
Date of Issue