Paper: Correcting Keyboard Layout Errors and Homoglyphs in Queries

ACL ID D14-1068
Title Correcting Keyboard Layout Errors and Homoglyphs in Queries
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2014
Authors

Keyboard layout errors and homoglyphs in cross-language queries impact our abil- ity to correctly interpret user informa- tion needs and offer relevant results. We present a machine learning approach to correcting these errors, based largely on character-level n-gram features. We demonstrate superior performance over rule-based methods, as well as a signif- icant reduction in the number of queries that yield null search results.