Paper: Using Mutual Information To Resolve Query Translation Ambiguities And Query Term Weighting

ACL ID P99-1029
Title Using Mutual Information To Resolve Query Translation Ambiguities And Query Term Weighting
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1999
Authors

An easy way of translating queries in one language to the other for cross-language information retrieval (IR) is to use a simple bilingual dictionary. Because of the general- purpose nature of such dictionaries, however, this simple method yields a severe translation ambiguity problem. This paper describes the degree to which this problem arises in Korean-English cross-language IR and suggests a relatively simple yet effective method for disambiguation using mutual information statistics obtained only from the target document collection. In this method, mutual information is used not only to select the best candidate but also to assign a weight to query terms in the target language. Our experimental results based on the TREC-6 collection shows that this method can achieve up to 85% of the ...