Paper: Arabic Language Modeling with Finite State Transducers

ACL ID P08-3007
Title Arabic Language Modeling with Finite State Transducers
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2008
Authors

In morphologically rich languages such as Arabic, the abundance of word forms result- ingfromincreasedmorphemecombinationsis significantly greater than for languages with fewer inflected forms (Kirchhoff et al., 2006). Thisexacerbatestheout-of-vocabulary(OOV) problem. Test set words are more likely to be unknown, limiting the effectiveness of the model. The goal of this study is to use the regularities of Arabic inflectional morphology to reduce the OOV problem in that language. We hope that success in this task will result in a decrease in word error rate in Arabic auto- matic speech recognition.