Paper: Identifying Bad Semantic Neighbors for Improving Distributional Thesauri

ACL ID P13-1055
Title Identifying Bad Semantic Neighbors for Improving Distributional Thesauri
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2013
Authors

Distributional thesauri are now widely used in a large number of Natural Lan- guage Processing tasks. However, they are far from containing only interesting semantic relations. As a consequence, improving such thesaurus is an impor- tant issue that is mainly tackled indirectly through the improvement of semantic sim- ilarity measures. In this article, we pro- pose a more direct approach focusing on the identification of the neighbors of a thesaurus entry that are not semantically linked to this entry. This identification re- lies on a discriminative classifier trained from unsupervised selected examples for building a distributional model of the entry in texts. Its bad neighbors are found by ap- plying this classifier to a representative set of occurrences of each of these neighbors. We ev...