翻訳メモリとコーパスを用いた学習に基づく訳語選択

内元 清貴; 関根 聡; 村田 真樹; 井佐原 均

Presentation	2001/10/10 Word Translation Based on Machine Learning Models Using Translation Memory and Corpora Kiyotaka UCHIMOTO, Satoshi SEKINE, Masaki MURATA, Hiroshi ISAHARA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	The second contest on word sensedisambiguation, SENSEVAL-2, was held in Spring, 2001. It consists of several tasks in various languages. In this paper, we describe our system that is used for one of these tasks: the Japanese translation task. In this task, senses of a word are defined in terms of the word's translations. Given an input sentence and a target word in the sentence, our system first estimates the similarity between the input sentence and parallel example sets called translation memory. It then selects an appropriate translation of the target word by using the example set with the highest similarity. The similarity is calculated using dynamic programming and a machine learning model which uses the following features: similarity of a string, words to the left and to the right of the target word in the input sentence, content words in the input sentence and their translations, and cooccurrence of content words in bilingual and monolingual corpora in English and Japanese. Our system achieves an accuracy of 63.4%, finishing the contest in third place among nine systems developed by seven groups.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Translation memory / Corpus / Similarity / Machine learning / Word translation
Paper #	NLC 2001-41
Date of Issue

Conference Information
Committee	NLC
Conference Date	2001/10/10(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Word Translation Based on Machine Learning Models Using Translation Memory and Corpora
Sub Title (in English)
Keyword(1)	Translation memory
Keyword(2)	Corpus
Keyword(3)	Similarity
Keyword(4)	Machine learning
Keyword(5)	Word translation
1st Author's Name	Kiyotaka UCHIMOTO
1st Author's Affiliation	Communications Research Laboratory()
2nd Author's Name	Satoshi SEKINE
2nd Author's Affiliation	New York University
3rd Author's Name	Masaki MURATA
3rd Author's Affiliation	Communications Research Laboratory
4th Author's Name	Hiroshi ISAHARA
4th Author's Affiliation	Communications Research Laboratory
Date	2001/10/10
Paper #	NLC 2001-41
Volume (vol)	vol.101
Number (no)	351
Page	pp.pp.-
#Pages	8
Date of Issue