Paper: A Data Driven Approach to Query Expansion in Question Answering

ACL ID W08-1805
Title A Data Driven Approach to Query Expansion in Question Answering
Venue Coling 2008: Proceedings of the workshop on Human Judgements in Computational Linguistics
Session
Year 2008
Authors

Automated answering of natural language questions is an interesting and useful prob- lem to solve. Question answering (QA) systems often perform information re- trieval at an initial stage. Information re- trieval (IR) performance, provided by en- gines such as Lucene, places a bound on overall system performance. For example, no answer bearing documents are retrieved at low ranks for almost 40% of questions. In this paper, answer texts from previous QA evaluations held as part of the Text REtrieval Conferences (TREC) are paired with queries and analysed in an attempt to identify performance-enhancing words. These words are then used to evaluate the performance of a query expansion method. Data driven extension words were found to help in over 70% of difficult questions. These words can be...