Paper: Context-Dependent Multilingual Lexical Lookup for Under-Resourced Languages

ACL ID P13-2053
Title Context-Dependent Multilingual Lexical Lookup for Under-Resourced Languages
Venue Annual Meeting of the Association of Computational Linguistics
Session Short Paper
Year 2013
Authors

Current approaches for word sense dis- ambiguation and translation selection typ- ically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilin- gual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English?Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean recip- rocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the contex...