Paper: Generating Confusion Sets for Context-Sensitive Error Correction

ACL ID D10-1094
Title Generating Confusion Sets for Context-Sensitive Error Correction
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2010
Authors

In this paper, we consider the problem of gen- erating candidate corrections for the task of correcting errors in text. We focus on the task of correcting errors in preposition usage made by non-native English speakers, using discriminative classifiers. The standard ap- proach to the problem assumes that the set of candidate corrections for a preposition con- sists of all preposition choices participating in the task. We determine likely preposition confusions using an annotated corpus of non- native text and use this knowledge to produce smaller sets of candidates. We propose several methods of restricting candidate sets. These methods exclude candi- date prepositions that are not observed as valid corrections in the annotated corpus and take into account the likelihood of each preposi- t...