Presentation | 1994/10/21 Document Classification Using Important Kanji Characters Extracted by X^2 Method Yasuhiko Watanabe, Masahito Takeuchi, Masaki Murata, Makoto Nagao, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | It is generally recognized to classify a given document into several categories by using technical words which preferably appear in one category than the other.However,we have much difficulties to extract the technical words properly from Japanese sentences.Instead of these technical words,we adopted kanji characters which preferably appear in one category than the other. In this paper,we describe how to extract the important kanji characters for document classification by X^2 method and how to classfy documents in a simple pattern classification method.Then, we examined our method and the correct recognition scores for"TENS EI JINGO",editorial articles,and articles in"SCIENCE"were 41.6%,77 .9%,and 92.7%,respectively. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | dooument classification / important kanji character / X^2 method / Nippon Decimal Classification / encyclopedia |
Paper # | NLC94-25 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 1994/10/21(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Document Classification Using Important Kanji Characters Extracted by X^2 Method |
Sub Title (in English) | |
Keyword(1) | dooument classification |
Keyword(2) | important kanji character |
Keyword(3) | X^2 method |
Keyword(4) | Nippon Decimal Classification |
Keyword(5) | encyclopedia |
1st Author's Name | Yasuhiko Watanabe |
1st Author's Affiliation | Department Electric Engineering II Faculty of Engineering,Kyoto University() |
2nd Author's Name | Masahito Takeuchi |
2nd Author's Affiliation | Department Electric Engineering II Faculty of Engineering,Kyoto University |
3rd Author's Name | Masaki Murata |
3rd Author's Affiliation | Department Electric Engineering II Faculty of Engineering,Kyoto University / |
4th Author's Name | Makoto Nagao |
4th Author's Affiliation | |
Date | 1994/10/21 |
Paper # | NLC94-25 |
Volume (vol) | vol.94 |
Number (no) | 292 |
Page | pp.pp.- |
#Pages | 8 |
Date of Issue |