Paper: Combining Morpheme-based Machine Translation with Post-processing Morpheme Prediction

ACL ID P11-1004
Title Combining Morpheme-based Machine Translation with Post-processing Morpheme Prediction
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2011
Authors

This paper extends the training and tun- ing regime for phrase-based statistical ma- chine translation to obtain fluent trans- lations into morphologically complex lan- guages (we build an English to Finnish translation system). Our methods use unsupervised morphology induction. Un- like previous work we focus on morpho- logically productive phrase pairs – our decoder can combine morphemes across phrase boundaries. Morphemes in the tar- get language may not have a corresponding morpheme or word in the source language. Therefore, we propose a novel combina- tion of post-processing morphology pre- diction with morpheme-based translation. We show, using both automatic evaluation scores and linguistically motivated analy- ses of the output, that our methods out- perform previously proposed o...