Paper: Wikipedia as Sense Inventory to Improve Diversity in Web Search Results

ACL ID P10-1138
Title Wikipedia as Sense Inventory to Improve Diversity in Web Search Results
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2010
Authors

Is it possible to use sense inventories to improve Web search results diversity for one word queries? To answer this ques- tion, we focus on two broad-coverage lex- ical resources of a different nature: Word- Net, as a de-facto standard used in Word Sense Disambiguation experiments; and Wikipedia, as a large coverage, updated encyclopaedic resource which may have a better coverage of relevant senses in Web pages. Our results indicate that (i) Wikipedia has a much better coverage of search results, (ii) the distribution of senses in search re- sults can be estimated using the internal graph structure of the Wikipedia and the relative number of visits received by each sense in Wikipedia, and (iii) associating Web pages to Wikipedia senses with sim- ple and efficient algorithms, we can pro- d...