Presentation | 2008-02-07 Extracting Opinions for Broadcasting using Multiple Sequence Alignment Takeshi S. KOBAYAKAWA, Naoto KATO, Jun'ichi TSUJII, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | The classification by the computer analysis of electronic opinions for braodcasting is studied. When analyzing the opinions for broadcasting, the characterestic point is that it is hard to enumerate with limited words beforehand what the opinion is about. Therefore, it is necessary to determine the part opinion is expressed without determining the part the opinion is about. In order to determine the part opinion is expressed, not the local n-gram information but a global information is dealt in this study. The frequently used expressions to express opinions are extracted by aligning many opinions which are morphologically analyzed. Taking the alignment more than two sequence is called multiple sequence alignment, which has an application for frequently used expressions. Multiple sequence alignment is only tractable when an approximated method is used, while we propose an approximated method to preserve an ambiguity which is likely to take place in natural language processing. The proposed method essentially becomes an exponential order of computation, so several cutoffs are introduced to guarantee the possibility of the computation. Since there are plural final candidates, external criteria are introduced to order the candidates. The proposed method is applied to the corpus of opinions for broadcasting which were collected in the experiment, and the accuracy of extracting expressions are compared with the conventional method. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Multiple Sequence Alignment / Dynamic Programming / Document Classification / Ambiguity of Alignment |
Paper # | NLC2007-91 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2008/1/31(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Extracting Opinions for Broadcasting using Multiple Sequence Alignment |
Sub Title (in English) | |
Keyword(1) | Multiple Sequence Alignment |
Keyword(2) | Dynamic Programming |
Keyword(3) | Document Classification |
Keyword(4) | Ambiguity of Alignment |
1st Author's Name | Takeshi S. KOBAYAKAWA |
1st Author's Affiliation | Japan Broadcasting Corporation:Graduate School of Information Science and Technology, The University of Tokyo() |
2nd Author's Name | Naoto KATO |
2nd Author's Affiliation | Japan Broadcasting Corporation |
3rd Author's Name | Jun'ichi TSUJII |
3rd Author's Affiliation | Graduate School of Information Science and Technology, The University of Tokyo |
Date | 2008-02-07 |
Paper # | NLC2007-91 |
Volume (vol) | vol.107 |
Number (no) | 480 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |