Paper: Lattice-Based Word Identification In CLARE

ACL ID P92-1021
Title Lattice-Based Word Identification In CLARE
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1992

I argue that because of spelling and typing errors and other properties of typed text, the identification of words and word boundaries in general requires syntactic and semantic knowledge. A lattice representation is there- fore appropriate for lexical analysis. I show how the use of such a representation in the CLARE system allows different kinds of hy- pothesis about word identity to be integrated in a uniform framework. I then describe a quantitative evaluation of CLARE's perfor- mance on a set of sentences into which ty- pographic errors have been introduced. The results show that syntax and semantics can be applied as powerful sources of constraint on the possible corrections for misspelled words.