Presentation | 2010-12-06 Bilingual Terminology Extraction from Wikipedia Combined with Web Search Engine Results Maike ERDMANN, Kotaro NAKAYAMA, Takahiro HARA, Shojiro NISHIO, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Wikipedia is a large-scale multilingual encyclopedia and therefore a promising resource for bilingual terminology extraction. However, current approaches translate only terms that are represented by a Wikipedia article. We believe that even if the article is missing, Wikipedia still contains valuable translation information. We propose a method that analyzes Wikipedia article texts and links to extract translation candidates, and uses Web search engine results to evaluate them. An experiment using terms extracted from a Japanese-English medical dictionary shows that our approach achieves high accuracy and coverage. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Bilingual Terminology / Wikipedia Mining / Web Mining |
Paper # | DE2010-25 |
Date of Issue |
Conference Information | |
Committee | DE |
---|---|
Conference Date | 2010/11/29(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Data Engineering (DE) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Bilingual Terminology Extraction from Wikipedia Combined with Web Search Engine Results |
Sub Title (in English) | |
Keyword(1) | Bilingual Terminology |
Keyword(2) | Wikipedia Mining |
Keyword(3) | Web Mining |
1st Author's Name | Maike ERDMANN |
1st Author's Affiliation | Department of Multimedia Engineering, Graduate School of Information Science and Technology, Osaka University() |
2nd Author's Name | Kotaro NAKAYAMA |
2nd Author's Affiliation | Center for Knowledge Structuring, The University of Tokyo |
3rd Author's Name | Takahiro HARA |
3rd Author's Affiliation | Department of Multimedia Engineering, Graduate School of Information Science and Technology, Osaka University |
4th Author's Name | Shojiro NISHIO |
4th Author's Affiliation | Department of Multimedia Engineering, Graduate School of Information Science and Technology, Osaka University |
Date | 2010-12-06 |
Paper # | DE2010-25 |
Volume (vol) | vol.110 |
Number (no) | 328 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |