Paper: Corpus-Dependent Association Thesauri For Information Retrieval

ACL ID C00-1059
Title Corpus-Dependent Association Thesauri For Information Retrieval
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2000
Authors

This paper presents a method for automati- cally generating an association thesaurus from a text corpus, and demonstrates its ap- plication to information retrieval. The the- saurus generation method.consists of ex- tracting tenns and co-occurrence data from a corpus and analyzing the correlation between terms statistically. A new method for dis- ambiguating the structure of compound nouns, which is a key component for term extraction, is also proposed. The automati- cally generated thesaurus is effectively used as a tool for exploring infonnation.