Presentation | 1997/7/25 The Reliability of Human Coding and Effects on Automatic Abstracting Tadashi Nomoto, Yuji Matsumoto, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | We discuss a particular approach to automatic abstracting, where an abstract is created by extracting important sentences from a text. A primary purpose of the paper is to demonstrate that the reliability of human supplied annotations on corpora has crucial effects on how well an automatic abstracting system performs. The corpus is developed through human judegments on possible summary sentences in a text. The reliability of human judgements is evaluated by the kappa statistic, a reliability metric standardly used in behavioral sciences. The C4.5 decision tree method (Quinlan, 1993) is used to build a extraction model. We demonstrate that there is a positive correlation of data reliability with a performance of automatic abstracting, and show results indicating that the reliability of human provided data is crucial for improving the performance of automatic abstracting. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | NATURAL LANGUAGE PROCESSING / JAPANESE / AUTOMATIC ABSTRACTING / SUMMARIZATION / CORPUS |
Paper # | NLC97-18 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 1997/7/25(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | The Reliability of Human Coding and Effects on Automatic Abstracting |
Sub Title (in English) | |
Keyword(1) | NATURAL LANGUAGE PROCESSING |
Keyword(2) | JAPANESE |
Keyword(3) | AUTOMATIC ABSTRACTING |
Keyword(4) | SUMMARIZATION |
Keyword(5) | CORPUS |
1st Author's Name | Tadashi Nomoto |
1st Author's Affiliation | Advanced Research Laboratory, Hitachi Ltd.() |
2nd Author's Name | Yuji Matsumoto |
2nd Author's Affiliation | Nara Institute of Science and Technology |
Date | 1997/7/25 |
Paper # | NLC97-18 |
Volume (vol) | vol.97 |
Number (no) | 200 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |