Presentation | 2011-12-19 Extraction of new abbreviated words using Crowdsourcing System Toshihiko SAKAI, Masayuki ASHIKAWA, Sachio HIROKAWA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | New words and abbreviated words are being born every day in CGM (consumer generated media) on the Web, such as Facebook and Twitter. Those words are not in the standard dictionaries and cause many difficulties in morphological analysis. This paper proposes a method to increase vocabularies from Twitter using Crowdsourcing. At the first stage, unknown words are chosen as candidates of new abbreviated words using a standard morphological analysis. At the second stage, Crowdsourcing System is used to determine if a word is an abbreviated word. Couwdsourcing System is used at the third stage to obtain the correct reading and the proper word. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Crowdsourcing / Abbreviated Words / Unknown Words / New Words / Unregistered Words |
Paper # | NLC2011-36,SP2011-81 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2011/12/12(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Extraction of new abbreviated words using Crowdsourcing System |
Sub Title (in English) | |
Keyword(1) | Crowdsourcing |
Keyword(2) | Abbreviated Words |
Keyword(3) | Unknown Words |
Keyword(4) | New Words |
Keyword(5) | Unregistered Words |
1st Author's Name | Toshihiko SAKAI |
1st Author's Affiliation | Graduate School of Information Science and Electrical Engineering, Kyushu University() |
2nd Author's Name | Masayuki ASHIKAWA |
2nd Author's Affiliation | Toshiba Research and Development Center |
3rd Author's Name | Sachio HIROKAWA |
3rd Author's Affiliation | Research Institute for Information Technology, Kyushu University |
Date | 2011-12-19 |
Paper # | NLC2011-36,SP2011-81 |
Volume (vol) | vol.111 |
Number (no) | 364 |
Page | pp.pp.- |
#Pages | 5 |
Date of Issue |