Paper: Word Alignment For Languages With Scarce Resources Using Bilingual Corpora Of Other Language Pairs

ACL ID P06-2112
Title Word Alignment For Languages With Scarce Resources Using Bilingual Corpora Of Other Language Pairs
Venue Annual Meeting of the Association of Computational Linguistics
Session Poster Session
Year 2006
Authors

This paper proposes an approach to im- prove word alignment for languages with scarce resources using bilingual corpora of other language pairs. To perform word alignment between languages L1 and L2, we introduce a third language L3. Al- though only small amounts of bilingual data are available for the desired lan- guage pair L1-L2, large-scale bilingual corpora in L1-L3 and L2-L3 are available. Based on these two additional corpora and with L3 as the pivot language, we build a word alignment model for L1 and L2. This approach can build a word alignment model for two languages even if no bilingual corpus is available in this language pair. In addition, we build an- other word alignment model for L1 and L2 using the small L1-L2 bilingual cor- pus. Then we interpolate the above two models to...