Presentation 2011-12-19
Extraction of new abbreviated words using Crowdsourcing System
Toshihiko SAKAI, Masayuki ASHIKAWA, Sachio HIROKAWA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) New words and abbreviated words are being born every day in CGM (consumer generated media) on the Web, such as Facebook and Twitter. Those words are not in the standard dictionaries and cause many difficulties in morphological analysis. This paper proposes a method to increase vocabularies from Twitter using Crowdsourcing. At the first stage, unknown words are chosen as candidates of new abbreviated words using a standard morphological analysis. At the second stage, Crowdsourcing System is used to determine if a word is an abbreviated word. Couwdsourcing System is used at the third stage to obtain the correct reading and the proper word.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Crowdsourcing / Abbreviated Words / Unknown Words / New Words / Unregistered Words
Paper # NLC2011-36,SP2011-81
Date of Issue

Conference Information
Committee NLC
Conference Date 2011/12/12(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Extraction of new abbreviated words using Crowdsourcing System
Sub Title (in English)
Keyword(1) Crowdsourcing
Keyword(2) Abbreviated Words
Keyword(3) Unknown Words
Keyword(4) New Words
Keyword(5) Unregistered Words
1st Author's Name Toshihiko SAKAI
1st Author's Affiliation Graduate School of Information Science and Electrical Engineering, Kyushu University()
2nd Author's Name Masayuki ASHIKAWA
2nd Author's Affiliation Toshiba Research and Development Center
3rd Author's Name Sachio HIROKAWA
3rd Author's Affiliation Research Institute for Information Technology, Kyushu University
Date 2011-12-19
Paper # NLC2011-36,SP2011-81
Volume (vol) vol.111
Number (no) 364
Page pp.pp.-
#Pages 5
Date of Issue