Wikipediaのカテゴリ情報を用いたTwitterユーザの関心分野の抽出(評判・評価,第3回テキストマイニング・シンポジウム)

Presentation	2013-09-12 An approach for extracting concerned area of Twitter users by category information in Wikipedia Yinjun HU, Yasuo TANIDA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Recently, Twitter becomes widely spread and owns a large number of users what attracts much attention for text mining research. In this paper, we propose a method of extracting concerned area of Twitter user using category information from Wikipedia. Twitter is a huge information repository, however, there is much word-sense ambiguation in Twitter data. Our approach is using the disambiguation pages of Wikipedia to do the word-sense disambiguation for the Twitter data. Finally, we categorized the disambiguated Twitter data by Wikipedia category information and treated the categorized data as the concerned area of the Twitter user.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Tweets / Concerned Area / Text mining / Wikipedia / Category
Paper #	NLC2013-17
Date of Issue

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	An approach for extracting concerned area of Twitter users by category information in Wikipedia
Sub Title (in English)
Keyword(1)	Tweets
Keyword(2)	Concerned Area
Keyword(3)	Text mining
Keyword(4)	Wikipedia
Keyword(5)	Category
1st Author's Name	Yinjun HU
1st Author's Affiliation	Synergy Marketing, Inc.()
2nd Author's Name	Yasuo TANIDA
2nd Author's Affiliation	Synergy Marketing, Inc.
Date	2013-09-12
Paper #	NLC2013-17
Volume (vol)	vol.113
Number (no)	213
Page	pp.pp.-
#Pages	5
Date of Issue