Paper: A Phrase-Based Statistical Model For SMS Text Normalization

Title A Phrase-Based Statistical Model For SMS Text Normalization
Year 2006

Short Messaging Service (SMS) texts be- have quite differently from normal written texts and have some very special phenom- ena. To translate SMS texts, traditional approaches model such irregularities di- rectly in Machine Translation (MT). How- ever, such approaches suffer from customization problem as tremendous ef- fort is required to adapt the language model of the existing translation system to handle SMS text style. We offer an alter- native approach to resolve such irregulari- ties by normalizing SMS texts before MT. In this paper, we view the task of SMS normalization as a translation problem from the SMS language to the English language 1 and we propose to adapt a phrase-based statistical MT model for the task. Evaluation by 5-fold cross validation on a parallel SMS normalized co...