Paper: Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model

ACL ID D14-1015
Title Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2014
Authors

We investigate how to improve bilingual embedding which has been successfully used as a feature in phrase-based sta- tistical machine translation (SMT). De- spite bilingual embedding?s success, the contextual information, which is of criti- cal importance to translation quality, was ignored in previous work. To employ the contextual information, we propose a simple and memory-efficient model for learning bilingual embedding, taking both the source phrase and context around the phrase into account. Bilingual translation scores generated from our proposed bilin- gual embedding model are used as features in our SMT system. Experimental results show that the proposed method achieves significant improvements on large-scale Chinese-English translation task.