Paper: A Joint Statistical Model for Simultaneous Word Spacing and Spelling Error Correction for Korean

ACL ID P07-2016
Title A Joint Statistical Model for Simultaneous Word Spacing and Spelling Error Correction for Korean
Venue Annual Meeting of the Association of Computational Linguistics
Session System Demonstration
Year 2007
Authors

This paper presents noisy-channel based Korean preprocessor system, which cor- rects word spacing and typographical errors. The proposed algorithm corrects both er- rors simultaneously. Using Eojeol transi- tion pattern dictionary and statistical data such as Eumjeol n-gram and Jaso transition probabilities, the algorithm minimizes the usage of huge word dictionaries.