Paper: Linguistic Regularities in Sparse and Explicit Word Representations

ACL ID W14-1618
Title Linguistic Regularities in Sparse and Explicit Word Representations
Venue International Conference on Computational Natural Language Learning
Session
Year 2014
Authors

Recent work has shown that neural- embedded word representations capture many relational similarities, which can be recovered by means of vector arithmetic in the embedded space. We show that Mikolov et al.?s method of first adding and subtracting word vectors, and then searching for a word similar to the re- sult, is equivalent to searching for a word that maximizes a linear combination of three pairwise word similarities. Based on this observation, we suggest an improved method of recovering relational similar- ities, improving the state-of-the-art re- sults on two recent word-analogy datasets. Moreover, we demonstrate that analogy recovery is not restricted to neural word embeddings, and that a similar amount of relational similarities can be recovered from traditional distributional wo...