Paper: Aggressive Morphology For Robust Lexical Coverage

ACL ID A00-1030
Title Aggressive Morphology For Robust Lexical Coverage
Venue Annual Conference of the North American Chapter of the Association for Computational Linguistics
Session Main Conference
Year 2000
Authors

This paper describes an approach to providing lex- ical information for natural language processing in unrestricted domains. A system of approximately 1200 morphological rules is used to extend a core lex- icon of 39,000 words to provide lexical coverage that exceeds that of a lexicon of 80,000 words or 150,000 word forms. The morphological system is described, and lexical coverage is evaluated for random words chosen from a previously unanalyzed corpus. 1 Motivation Many applications of natural language processing have a need for a large vocabulary lexicon. How- ever, no matter how large a lexicon one starts with, most applications will encounter terms that are not covered. This paper describes an approach to the lexicon problem that emphasizes recognition of mor- phological structure in ...