Paper: Extracting Multiword Translations from Aligned Comparable Documents

ACL ID W14-1016
Title Extracting Multiword Translations from Aligned Comparable Documents
Venue Workshop on Hybrid Approaches to Translation
Session
Year 2014
Authors

Most previous attempts to identify trans- lations of multiword expressions using comparable corpora relied on dictionaries of single words. The translation of a mul- tiword was then constructed from the translations of its components. In con- trast, in this work we try to determine the translation of a multiword unit by analyz- ing its contextual behaviour in aligned comparable documents, thereby not pre- supposing any given dictionary. Whereas with this method translation results for single words are rather good, the results for multiword units are considerably worse. This is an indication that the type of multiword expressions considered here is too infrequent to provide a sufficient amount of contextual information. Thus indirectly it is confirmed that it should make sense...