Paper: Morphological Analysis and Disambiguation for Dialectal Arabic

ACL ID N13-1044
Title Morphological Analysis and Disambiguation for Dialectal Arabic
Venue Annual Conference of the North American Chapter of the Association for Computational Linguistics
Session Main Conference
Year 2013
Authors

The many differences between Dialectal Ara- bic and Modern Standard Arabic (MSA) pose a challenge to the majority of Arabic natural language processing tools, which are designed for MSA. In this paper, we retarget an exist- ing state-of-the-art MSA morphological tag- ger to Egyptian Arabic (ARZ). Our evalua- tion demonstrates that our ARZ morphology tagger outperforms its MSA variant on ARZ input in terms of accuracy in part-of-speech tagging, diacritization, lemmatization and to- kenization; and in terms of utility for ARZ-to- English statistical machine translation.