Paper: A Phrase-Based Statistical Model For SMS Text Normalization

ACL ID P06-2005
Title A Phrase-Based Statistical Model For SMS Text Normalization
Venue Annual Meeting of the Association of Computational Linguistics
Session Poster Session
Year 2006

Short Messaging Service (SMS) texts be- have quite differently from normal written texts and have some very special phenom- ena. To translate SMS texts, traditional approaches model such irregularities di- rectly in Machine Translation (MT). How- ever, such approaches suffer from customization problem as tremendous ef- fort is required to adapt the language model of the existing translation system to handle SMS text style. We offer an alter- native approach to resolve such irregulari- ties by normalizing SMS texts before MT. In this paper, we view the task of SMS normalization as a translation problem from the SMS language to the English language 1 and we propose to adapt a phrase-based statistical MT model for the task. Evaluation by 5-fold cross validation on a parallel SMS normalized co...