Paper: Finding Parts In Very Large Corpora

ACL ID P99-1008
Title Finding Parts In Very Large Corpora
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1999
Authors

We present a method for extracting parts of objects from wholes (e.g. "speedometer" from "car"). Given a very large corpus our method finds part words with 55% accuracy for the top 50 words as ranked by the system. The part list could be scanned by an end-user and added to an existing ontology (such as WordNet), or used as a part of a rough semantic lexicon.