Paper: Context-Based Morphological Disambiguation With Random Fields

ACL ID H05-1060
Title Context-Based Morphological Disambiguation With Random Fields
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2005

Finite-state approaches have been highly successful at describ- ing the morphological processes of many languages. Such approaches have largely focused on modeling the phone- or character-level processes that generate candidate lexical types, rather than tokens in context. For the full analysis of words in context, disambiguation is also required (Hakkani-T¨ur et al. , 2000; Hajiˇc et al. , 2001). In this paper, we apply a novel source-channel model to the problem of morphological disam- biguation (segmentation into morphemes, lemmatization, and POS tagging) for concatenative, templatic, and inflectional lan- guages. The channel model exploits an existing morphological dictionary, constraining each word’s analysis to be linguistically valid. The source model is a factored, conditionall...