文書集合における重要語の抽出

Presentation	1998/5/13 Extraction of Important Words in the Document Set Atsunobu Koizumi, Takashi Okuda, Syuich Itoh,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper describes a method to extract important word from a document set using word frequency. We propose the notion of important word which represenets the inter-document characteristics within the given document set and the content of documents. The Kullback-Leibler distances between p(d\|w), which is probability of a document conditioned by a word, and p(d) are calculated and the words are ranked by this quantity. Experimental results are shown and discussed.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	important word / auto extraction / document analysis / divergence
Paper #
Date of Issue

Paper Information
Registration To	Data Engineering (DE)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Extraction of Important Words in the Document Set
Sub Title (in English)
Keyword(1)	important word
Keyword(2)	auto extraction
Keyword(3)	document analysis
Keyword(4)	divergence
1st Author's Name	Atsunobu Koizumi
1st Author's Affiliation	Graduate School of Information Systems, Univ.Electro-Communications()
2nd Author's Name	Takashi Okuda
2nd Author's Affiliation	Fujitsu
3rd Author's Name	Syuich Itoh
3rd Author's Affiliation	Graduate School of Information Systems, Univ.Electro-Communications
Date	1998/5/13
Paper #
Volume (vol)	vol.98
Number (no)	42
Page	pp.pp.-
#Pages	6
Date of Issue