Paper: A Framework for Translating SMS Messages

ACL ID C14-1092
Title A Framework for Translating SMS Messages
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2014

Short Messaging Service (SMS) has become a popular form of communication. While it is predominantly used for monolingual communication, it can be extremely useful for facilitating cross-lingual communication through statistical machine translation. In this work we present an application of statistical machine translation to SMS messages. We decouple the SMS transla- tion task into normalization followed by translation so that one can exploit existing bitext re- sources and present a novel unsupervised normalization approach using distributed representa- tion of words learned through neural networks. We describe several surrogate data that are good approximations to real SMS data feeds and use a hybrid translation approach using finite-state transducers. Both objective and subjective evalua...