Presentation 2003/8/22
Multi Document Summary Analysis with Faceted Genre
Yohei SEKI, Koji EGUCHI, Noriko KANDO,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We assume that summaries vary according to the reader's information needs/intentions for information retrieval. We set several summary types like fact-based summary, prospective summary, numeric-data focused summary, or advice (opinion-oriented) summary etc. In our research, we define the viewpoints by combination of topics and summary types. The purpose of this study is to build a multi-document summarizer to produce summaries according to viewpoints. As an exploratory stage of investigation, we examined effectiveness of source documents genre to produce different types of summaries. We estimated R^2 (coefficeint of determination) improvement rates for sentence extraction with multiple linear regression analysis. The criterion variables is the sentence weights in source documents. The weights are computed based on the similarity of the human-produced summaries and sentences in the source documents. The explatory variables are sentence weitghting parameters. If we add 4 genre features into the explatory variables, from 4 to 12 summaries have a significant improvement for R^2 by each. We revealed that the augumentative feature (the third genre feature) relates to the commentary summary type.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Multi-Document Summarization / Viewpoint Summary / Genre Classification / Faceted Classification / Multiple Linear Regression Analysis / Opinion Summarization
Paper # NLC2003-23
Date of Issue

Conference Information
Committee NLC
Conference Date 2003/8/22(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Multi Document Summary Analysis with Faceted Genre
Sub Title (in English)
Keyword(1) Multi-Document Summarization
Keyword(2) Viewpoint Summary
Keyword(3) Genre Classification
Keyword(4) Faceted Classification
Keyword(5) Multiple Linear Regression Analysis
Keyword(6) Opinion Summarization
1st Author's Name Yohei SEKI
1st Author's Affiliation Department of Informatics, The Graduate University for Advanced Studies()
2nd Author's Name Koji EGUCHI
2nd Author's Affiliation Department of Informatics, The Graduate University for Advanced Studies:National Institute of Informatics
3rd Author's Name Noriko KANDO
3rd Author's Affiliation Department of Informatics, The Graduate University for Advanced Studies:National Institute of Informatics
Date 2003/8/22
Paper # NLC2003-23
Volume (vol) vol.103
Number (no) 280
Page pp.pp.-
#Pages 6
Date of Issue