Paper: Improvements In Phrase-Based Statistical Machine Translation

ACL ID N04-1033
Title Improvements In Phrase-Based Statistical Machine Translation
Venue Human Language Technologies
Session Main Conference
Year 2004
Authors

In statistical machine translation, the currently best performing systems are based in some way on phrases or word groups. We describe the baseline phrase-based translation system and various refinements. We describe a highly ef- ficient monotone search algorithm with a com- plexity linear in the input sentence length. We present translation results for three tasks: Verb- mobil, Xerox and the Canadian Hansards. For the Xerox task, it takes less than 7 seconds to translate the whole test set consisting of more than 10K words. The translation results for the Xerox and Canadian Hansards task are very promising. The system even outperforms the alignment template system.