Paper: Word Sense Induction: Triplet-Based Clustering And Automatic Evaluation

ACL ID E06-1018
Title Word Sense Induction: Triplet-Based Clustering And Automatic Evaluation
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 2006
Authors

In this paper a novel solution to auto- matic and unsupervised word sense induc- tion (WSI) is introduced. It represents an instantiation of the ‘one sense per colloca- tion’ observation (Gale et al. , 1992). Like most existing approaches it utilizes clus- tering of word co-occurrences. This ap- proach differs from other approaches to WSI in that it enhances the effect of the one sense per collocation observation by using triplets of words instead of pairs. The combination with a two-step cluster- ing process using sentence co-occurrences as features allows for accurate results. Ad- ditionally, a novel and likewise automatic and unsupervised evaluation method in- spired by Sch¨utze’s (1992) idea of evalu- ation of word sense disambiguation algo- rithms is employed. Offering advantag...