Paper: Finding Content-Bearing Terms Using Term Similarities

ACL ID E99-1034
Title Finding Content-Bearing Terms Using Term Similarities
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 1999

This paper explores the issue of using dif- ferent co-occurrence similarities between terms for separating query terms that are useful for retrieval from those that are harmful. The hypothesis under examina- tion is that useful terms tend to be more similar to each other than to other query terms. Preliminary experiments with similarities computed using first-order and second-order co-occurrence seem to confirm the hypothesis. Term similari- ties could then be used for determining which query terms are useful and best reflect the user's information need. A possible application would be to use this source of evidence for tuning the weights of the query terms.