Paper: Measuring The Similarity Between Compound Nouns In Different Languages Using Non-Parallel Corpora

ACL ID C02-1065
Title Measuring The Similarity Between Compound Nouns In Different Languages Using Non-Parallel Corpora
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2002
Authors
  • Takaaki Tanaka (NTT Communication Science Laboratories, Kyoto Japan)

Thispaperpresentsamethodthatmeasuresthe similarity between compound nouns in difierent languagestolocatetranslationequivalentsfrom corpora. Themethodusesinformationfromun- related corpora in difierent languages that do not have to be parallel. This means that many corporacanbeused. Themethodcomparesthe contexts of target compound nouns and trans- lation candidates in the word or semantic at- tribute level. In this paper, we show how this measuring method can be applied to select the best English translation candidate for Japanese compoundnounsinmorethan70%ofthecases.