Paper: Learning Dense Models of Query Similarity from User Click Logs

ACL ID N10-1071
Title Learning Dense Models of Query Similarity from User Click Logs
Venue Human Language Technologies
Session Main Conference
Year 2010
Authors

The goal of this work is to integrate query similarity metrics as features into a dense model that can be trained on large amounts of query log data, in order to rank query rewrites. We propose features that incorpo- rate various notions of syntactic and semantic similarity in a generalized edit distance frame- work. We use the implicit feedback of user clicks on search results as weak labels in train- ing linear ranking models on large data sets. We optimize different ranking objectives in a stochastic gradient descent framework. Our experiments show that a pairwise SVM ranker trained on multipartite rank levels outperforms other pairwise and listwise ranking methods under a variety of evaluation metrics.