Presentation 1994/10/21
Document Classification Using Important Kanji Characters Extracted by X^2 Method
Yasuhiko Watanabe, Masahito Takeuchi, Masaki Murata, Makoto Nagao,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) It is generally recognized to classify a given document into several categories by using technical words which preferably appear in one category than the other.However,we have much difficulties to extract the technical words properly from Japanese sentences.Instead of these technical words,we adopted kanji characters which preferably appear in one category than the other. In this paper,we describe how to extract the important kanji characters for document classification by X^2 method and how to classfy documents in a simple pattern classification method.Then, we examined our method and the correct recognition scores for"TENS EI JINGO",editorial articles,and articles in"SCIENCE"were 41.6%,77 .9%,and 92.7%,respectively.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) dooument classification / important kanji character / X^2 method / Nippon Decimal Classification / encyclopedia
Paper # NLC94-25
Date of Issue

Conference Information
Committee NLC
Conference Date 1994/10/21(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Document Classification Using Important Kanji Characters Extracted by X^2 Method
Sub Title (in English)
Keyword(1) dooument classification
Keyword(2) important kanji character
Keyword(3) X^2 method
Keyword(4) Nippon Decimal Classification
Keyword(5) encyclopedia
1st Author's Name Yasuhiko Watanabe
1st Author's Affiliation Department Electric Engineering II Faculty of Engineering,Kyoto University()
2nd Author's Name Masahito Takeuchi
2nd Author's Affiliation Department Electric Engineering II Faculty of Engineering,Kyoto University
3rd Author's Name Masaki Murata
3rd Author's Affiliation Department Electric Engineering II Faculty of Engineering,Kyoto University /
4th Author's Name Makoto Nagao
4th Author's Affiliation
Date 1994/10/21
Paper # NLC94-25
Volume (vol) vol.94
Number (no) 292
Page pp.pp.-
#Pages 8
Date of Issue