Paper: Part-of-Speech Tagging for English-Spanish Code-Switched Text

ACL ID D08-1110
Title Part-of-Speech Tagging for English-Spanish Code-Switched Text
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2008
Authors

Code-switching is an interesting linguistic phenomenon commonly observed in highly bilingual communities. It consists of mixing languages in the same conversational event. This paper presents results on Part-of-Speech tagging Spanish-English code-switched dis- course. We explore different approaches to exploit existing resources for both languages that range from simple heuristics, to language identification, to machine learning. The best results are achieved by training a machine learning algorithm with features that combine the output of an English and a Spanish Part- of-Speech tagger.