Paper Abstract and Keywords |
Presentation |
2011-07-08 15:20
Analyzing Topics of Blogs based on Wikipedia as a Multilingual Knowledge Source Kensaku Makita, Daisuke Yokomoto, Hiroko Suzuki, Takehito Utsuro (Univ. of Tsukuba), Yasuhide Kawada (Navix), Tomohiro Fukuhara (AIST) NLC2011-18 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Given a search query, most existing search engines simply return a
ranked list of search results. However, it is often the case that
those search result documents consist of a mixture of documents that
are closely related to various sub-topics. This is also true for the
case of our previously developed framework of retrieving blog posts
which are closely related to a certain topic. In this paper, we
propose a framework of categorizing blog posts according to their
sub-topics, where, given a search query, those blog posts are
automatically collected from the blogosphere. In our framework, the
sub-topic of each blog post is identified by utilizing Wikipedia
entries as a knowledge source and each Wikipedia entry title is
considered as a sub-topic label. This paper especially presents
examples of applying the proposed framework to Japanese / Korean /
English blogospheres. Through those examples, we show that it becomes
much easier to quickly overview the distribution of sub-topics over the
whole blog posts collected with a certain search query. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
blog analysis / topic / Wikipedia / sub-topic categorization / facets / / / |
Reference Info. |
IEICE Tech. Rep., vol. 111, no. 119, NLC2011-18, pp. 95-100, July 2011. |
Paper # |
NLC2011-18 |
Date of Issue |
2011-06-30 (NLC) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
NLC2011-18 |
Conference Information |
Committee |
NLC |
Conference Date |
2011-07-07 - 2011-07-08 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
IBM Japan, Ltd. |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
The First Symposium on Text Mining |
Paper Information |
Registration To |
NLC |
Conference Code |
2011-07-NLC |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Analyzing Topics of Blogs based on Wikipedia as a Multilingual Knowledge Source |
Sub Title (in English) |
|
Keyword(1) |
blog analysis |
Keyword(2) |
topic |
Keyword(3) |
Wikipedia |
Keyword(4) |
sub-topic categorization |
Keyword(5) |
facets |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Kensaku Makita |
1st Author's Affiliation |
University of Tsukuba (Univ. of Tsukuba) |
2nd Author's Name |
Daisuke Yokomoto |
2nd Author's Affiliation |
University of Tsukuba (Univ. of Tsukuba) |
3rd Author's Name |
Hiroko Suzuki |
3rd Author's Affiliation |
University of Tsukuba (Univ. of Tsukuba) |
4th Author's Name |
Takehito Utsuro |
4th Author's Affiliation |
University of Tsukuba (Univ. of Tsukuba) |
5th Author's Name |
Yasuhide Kawada |
5th Author's Affiliation |
Navix Co., Ltd. (Navix) |
6th Author's Name |
Tomohiro Fukuhara |
6th Author's Affiliation |
National Institute of Advanced Industrial Science and Technology (AIST) |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2011-07-08 15:20:00 |
Presentation Time |
25 minutes |
Registration for |
NLC |
Paper # |
NLC2011-18 |
Volume (vol) |
vol.111 |
Number (no) |
no.119 |
Page |
pp.95-100 |
#Pages |
6 |
Date of Issue |
2011-06-30 (NLC) |
|