Paper: A Report on the First Native Language Identification Shared Task

ACL ID W13-1706
Title A Report on the First Native Language Identification Shared Task
Venue Innovative Use of NLP for Building Educational Applications
Session
Year 2013
Authors

Native Language Identification, or NLI, is the task of automatically classifying the L1 of a writer based solely on his or her essay writ- ten in another language. This problem area has seen a spike in interest in recent years as it can have an impact on educational ap- plications tailored towards non-native speak- ers of a language, as well as authorship pro- filing. While there has been a growing body of work in NLI, it has been difficult to com- pare methodologies because of the different approaches to pre-processing the data, differ- ent sets of languages identified, and different splits of the data used. In this shared task, the first ever for Native Language Identification, we sought to address the above issues by pro- viding a large corpus designed specifically for NLI, in addition ...