Paper: Identifying Broken Plurals, Irregular Gender, and Rationality in Arabic Text

ACL ID E12-1069
Title Identifying Broken Plurals, Irregular Gender, and Rationality in Arabic Text
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 2012
Authors

Arabic morphology is complex, partly be- cause of its richness, and partly because of common irregular word forms, such as broken plurals (which resemble singular nouns), and nouns with irregular gender (feminine nouns that look masculine and vice versa). In addition, Arabic morpho- syntactic agreement interacts with the lex- ical semantic feature of rationality, which has no morphological realization. In this paper, we present a series of experiments on the automatic prediction of the latent linguistic features of functional gender and number, and rationality in Arabic. We com- pare two techniques, using simple maxi- mum likelihood (MLE) with back-off and a support vector machine based sequence tagger (Yamcha). We study a number of orthographic, morphological and syntactic learning featur...