Paper: Complementing WordNet With Roget's And Corpus-Based Thesauri For Information Retrieval

ACL ID E99-1013
Title Complementing WordNet With Roget's And Corpus-Based Thesauri For Information Retrieval
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 1999
Authors

This paper proposes a method to over- come the drawbacks of WordNet when applied to information retrieval by com- plementing it with Roget's thesaurus and corpus-derived thesauri. Words and rela- tions which are not included in WordNet can be found in the corpus-derived the- sauri. Effects of polysemy can be min- imized with weighting method consider- ing all query terms and all of the the- sauri. Experimental results show that our method enhances information re- trieval performance significantly. Department of Computer Science Tokyo Institute of Technology 2-12-1 Oookayama Meguro-Ku Tokyo 152-8522 Japan {rila,take,tanaka}@cs.titech.ac.jp expansion (Voorhees, 1994; Smeaton and Berrut, 1995), computing lexical cohesion (Stairmand, 1997), word sense disambiguation (Voorhees, 1993), and so on...