Paper: Re-training Monolingual Parser Bilingually for Syntactic SMT

ACL ID D12-1078
Title Re-training Monolingual Parser Bilingually for Syntactic SMT
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2012

The training of most syntactic SMT approaches involves two essential components, word alignment and monolingual parser. In the current state of the art these two components are mutually independent, thus causing problems like lack of rule generalization, and violation of syntactic correspondence in translation rules. In this paper, we propose two ways of re-training monolingual parser with the target of maximizing the consistency between parse trees and alignment matrices. One is targeted self-training with a simple evaluation function; the other is based on training data selection from forced alignment of bilingual data. We also propose an auxiliary method for boosting alignment quality, by symmetrizing alignment matrices with respect to parse trees. The best combination ...