Paper: Exploring the use of word embeddings and random walks on Wikipedia for the CogAlex shared task

ACL ID W14-4704
Title Exploring the use of word embeddings and random walks on Wikipedia for the CogAlex shared task
Venue Workshop on Cognitive Aspects of the Lexicon
Session
Year 2014
Authors

In our participation on the task we wanted to test three different kinds of relatedness algorithms: one based on embeddings induced from corpora, another based on random walks on WordNet and a last one based on random walks based on Wikipedia. All three of them perform similarly in noun relatedness datasets like WordSim353, close to the highest reported values. Although the task definition gave examples of nouns, the train and test data were based on the Edinburgh Association Thesaurus, and around 50% of the target words were not nouns. The corpus-based algorithm performed much better than the other methods in the training dataset, and was thus submitted for the test.