Paper: Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction

ACL ID C98-2177
Title Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1998
Authors

Generating semantic lexicons semi- automatically could be a great time saver, relative to creating them by hand. In this paper, we present an algorithm for extracting potential entries for a category from an on-line corpus, based upon a small set of exemplars. Our algorithm finds more correct terms and fewer incorrect ones than previous work in this area. Additionally, the entries that are generated potentially provide broader coverage of the category than would occur to an indi- vidual coding them by hand. Our algorithm finds many terms not included within Wordnet (many more than previous algorithms), and could be viewed as an "enhancer" of existing broad-coverage resources.