Paper: Syntactic Models for Structural Word Insertion and Deletion during Translation

ACL ID D08-1077
Title Syntactic Models for Structural Word Insertion and Deletion during Translation
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2008
Authors

An important problem in translation neglected by most recent statistical machine translation systems is insertion and deletion of words, such as function words, motivated by linguistic structure rather than adjacent lexical context. Phrasal and hierarchical systems can only insert or delete words in the context of a larger phrase or rule. While this may suffice when translating in-domain, it performs poorly when trying to translate broad domains such as web text. Various syntactic approaches have been proposed that begin to address this problem by learning lexicalized and unlexicalized rules. Among these, the treelet approach uses unlexicalized order templates to model ordering separately from lexical choice. We introduce an extension to the latter that allows for struct...