Paper: A global model for joint lemmatization and part-of-speech prediction

ACL ID P09-1055
Title A global model for joint lemmatization and part-of-speech prediction
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2009
Authors

We present a global joint model for lemmatization and part-of-speech predic- tion. Using only morphological lexicons and unlabeled data, we learn a partially- supervised part-of-speech tagger and a lemmatizer which are combined using fea- tures on a dynamically linked dependency structure of words. We evaluate our model on English, Bulgarian, Czech, and Slovene, and demonstrate substantial im- provements over both a direct transduction approach to lemmatization and a pipelined approach, which predicts part-of-speech tags before lemmatization.