Presentation 2013-09-12
An approach for extracting concerned area of Twitter users by category information in Wikipedia
Yinjun HU, Yasuo TANIDA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Recently, Twitter becomes widely spread and owns a large number of users what attracts much attention for text mining research. In this paper, we propose a method of extracting concerned area of Twitter user using category information from Wikipedia. Twitter is a huge information repository, however, there is much word-sense ambiguation in Twitter data. Our approach is using the disambiguation pages of Wikipedia to do the word-sense disambiguation for the Twitter data. Finally, we categorized the disambiguated Twitter data by Wikipedia category information and treated the categorized data as the concerned area of the Twitter user.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Tweets / Concerned Area / Text mining / Wikipedia / Category
Paper # NLC2013-17
Date of Issue

Conference Information
Committee NLC
Conference Date 2013/9/5(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) An approach for extracting concerned area of Twitter users by category information in Wikipedia
Sub Title (in English)
Keyword(1) Tweets
Keyword(2) Concerned Area
Keyword(3) Text mining
Keyword(4) Wikipedia
Keyword(5) Category
1st Author's Name Yinjun HU
1st Author's Affiliation Synergy Marketing, Inc.()
2nd Author's Name Yasuo TANIDA
2nd Author's Affiliation Synergy Marketing, Inc.
Date 2013-09-12
Paper # NLC2013-17
Volume (vol) vol.113
Number (no) 213
Page pp.pp.-
#Pages 5
Date of Issue