Presentation 2010-12-06
Bilingual Terminology Extraction from Wikipedia Combined with Web Search Engine Results
Maike ERDMANN, Kotaro NAKAYAMA, Takahiro HARA, Shojiro NISHIO,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Wikipedia is a large-scale multilingual encyclopedia and therefore a promising resource for bilingual terminology extraction. However, current approaches translate only terms that are represented by a Wikipedia article. We believe that even if the article is missing, Wikipedia still contains valuable translation information. We propose a method that analyzes Wikipedia article texts and links to extract translation candidates, and uses Web search engine results to evaluate them. An experiment using terms extracted from a Japanese-English medical dictionary shows that our approach achieves high accuracy and coverage.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Bilingual Terminology / Wikipedia Mining / Web Mining
Paper # DE2010-25
Date of Issue

Conference Information
Committee DE
Conference Date 2010/11/29(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Data Engineering (DE)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Bilingual Terminology Extraction from Wikipedia Combined with Web Search Engine Results
Sub Title (in English)
Keyword(1) Bilingual Terminology
Keyword(2) Wikipedia Mining
Keyword(3) Web Mining
1st Author's Name Maike ERDMANN
1st Author's Affiliation Department of Multimedia Engineering, Graduate School of Information Science and Technology, Osaka University()
2nd Author's Name Kotaro NAKAYAMA
2nd Author's Affiliation Center for Knowledge Structuring, The University of Tokyo
3rd Author's Name Takahiro HARA
3rd Author's Affiliation Department of Multimedia Engineering, Graduate School of Information Science and Technology, Osaka University
4th Author's Name Shojiro NISHIO
4th Author's Affiliation Department of Multimedia Engineering, Graduate School of Information Science and Technology, Osaka University
Date 2010-12-06
Paper # DE2010-25
Volume (vol) vol.110
Number (no) 328
Page pp.pp.-
#Pages 6
Date of Issue