Paper Abstract and Keywords |
Presentation |
2008-02-07 15:00
Extracting Opinions for Broadcasting using Multiple Sequence Alignment Takeshi S. Kobayakawa, Naoto Kato (NHK), Jun'ichi Tsujii (Tokyo Univ.) NLC2007-91 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
The classification by the computer analysis of electronic opinions for braodcasting is studied. When analyzing the opinions for broadcasting, the characterestic point is that it is hard to enumerate with limited words beforehand what the opinion is about. Therefore, it is necessary to determine the part opinion is expressed without determining the part the opinion is about. In order to determine the part opinion is expressed, not the local n-gram information but a global information is dealt in this study. The frequently used expressions to express opinions are extracted by aligning many opinions which are morphologically analyzed. Taking the alignment more than two sequence is called multiple sequence alignment, which has an application for frequently used expressions. Multiple sequence alignment is only tractable when an approximated method is used, while we propose an approximated method to preserve an ambiguity which is likely to take place in natural language processing. The proposed method essentially becomes an exponential order of computation, so several cutoffs are introduced to guarantee the possibility of the computation. Since there are plural final candidates, external criteria are introduced to order the candidates. The proposed method is applied to the corpus of opinions for broadcasting which were collected in the experiment, and the accuracy of extracting expressions are compared with the conventional method. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Multiple Sequence Alignment / Dynamic Programming / Document Classification / Ambiguity of Alignment / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 107, no. 480, NLC2007-91, pp. 25-30, Feb. 2008. |
Paper # |
NLC2007-91 |
Date of Issue |
2008-01-31 (NLC) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
NLC2007-91 |
Conference Information |
Committee |
NLC |
Conference Date |
2008-02-07 - 2008-02-08 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Yuzawa Culture Center |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
NLC |
Conference Code |
2008-02-NLC |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Extracting Opinions for Broadcasting using Multiple Sequence Alignment |
Sub Title (in English) |
|
Keyword(1) |
Multiple Sequence Alignment |
Keyword(2) |
Dynamic Programming |
Keyword(3) |
Document Classification |
Keyword(4) |
Ambiguity of Alignment |
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Takeshi S. Kobayakawa |
1st Author's Affiliation |
Japan Broadcasting Corporation (NHK) |
2nd Author's Name |
Naoto Kato |
2nd Author's Affiliation |
Japan Broadcasting Corporation (NHK) |
3rd Author's Name |
Jun'ichi Tsujii |
3rd Author's Affiliation |
Graduate School of Information Science and Technology, The University of Tokyo (Tokyo Univ.) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2008-02-07 15:00:00 |
Presentation Time |
30 minutes |
Registration for |
NLC |
Paper # |
NLC2007-91 |
Volume (vol) |
vol.107 |
Number (no) |
no.480 |
Page |
pp.25-30 |
#Pages |
6 |
Date of Issue |
2008-01-31 (NLC) |
|