Paper: Evaluating Cross-Language Annotation Transfer In The MultiSemCor Corpus

ACL ID C04-1053
Title Evaluating Cross-Language Annotation Transfer In The MultiSemCor Corpus
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2004
Authors

In this paper we illustrate and evaluate an approach to the creation of high quality linguistically annotated resources based on the exploitation of aligned parallel corpora. This approach is based on the assumption that if a text in one language has been annotated and its translation has not, annotations can be transferred from the source text to the target using word alignment as a bridge. The transfer approach has been tested in the creation of the MultiSemCor corpus, an English/Italian parallel corpus created on the basis of the English SemCor corpus. In MultiSemCor texts are aligned at the word level and semantically annotated with a shared inventory of senses. We present some experiments carried out to evaluate the different steps involved in the methodology. The results of the evalu...