ACL Anthology Network (All About NLP) (beta) The Association Of Computational Linguistics Anthology Network |
ACL ID | W13-1718 |
---|---|
Title | Cognate and Misspelling Features for Natural Language Identification |
Venue | Innovative Use of NLP for Building Educational Applications |
Session | |
Year | 2013 |
Authors |
We apply Support Vector Machines to differ- entiate between 11 native languages in the 2013 Native Language Identification Shared Task. We expand a set of common language identification features to include cognate inter- ference and spelling mistakes. Our best results are obtained with a classifier which includes both the cognate and the misspelling features, as well as word unigrams, word bigrams, char- acter bigrams, and syntax production rules.