Paper: Part of Speech Tagging for French Social Media Data

ACL ID C14-1166
Title Part of Speech Tagging for French Social Media Data
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2014

In the context of Social Media Analytics, Natural Language Processing tools face new chal- lenges on on-line conversational text, such as microblogs, chat, or text messages, because of the specificity of the language used in these channels. This work addresses the problem of Part- Of-Speech tagging (initially for French but also for English) on noisy language usage from the popular social media services like Twitter, Facebook and forums. We employ a linear-chain con- ditional random fields (CRFs) model, enriched with several morphological, orthographic, lexical and large-scale word clustering features. Our experiments used different feature configurations to train the model. We achieved a higher tagging performance with these features, compared to baseline results on French social media ba...