Paper: Inducing Sentence Structure from Parallel Corpora for Reordering

ACL ID D11-1018
Title Inducing Sentence Structure from Parallel Corpora for Reordering
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2011
Authors

When translating among languages that differ substantially in word order, machine transla- tion (MT) systems benefit from syntactic pre- ordering—an approach that uses features from a syntactic parse to permute source words into a target-language-like order. This paper presents a method for inducing parse trees au- tomatically from a parallel corpus, instead of using a supervised parser trained on a tree- bank. These induced parses are used to pre- order source sentences. We demonstrate that our induced parser is effective: it not only improves a state-of-the-art phrase-based sys- tem with integrated reordering, but also ap- proaches the performance of a recent pre- ordering method based on a supervised parser. These results show that the syntactic structure which is relevant to MT pre-o...