Paper: Noun Phrase Analysis In Large Unrestricted Text For Information Retrieval

ACL ID P96-1003
Title Noun Phrase Analysis In Large Unrestricted Text For Information Retrieval
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1996
Authors

Information retrieval is an important ap- plication area of natural-language pro- cessing where one encounters the gen- uine challenge of processing large quanti- ties of unrestricted natural-language text. This paper reports on the application of a few simple, yet robust and efficient noun- phrase analysis techniques to create bet- ter indexing phrases for information re- trieval. In particular, we describe a hy- brid approach to the extraction of mean- ingful (continuous or discontinuous) sub- compounds from complex noun phrases using both corpus statistics and linguistic heuristics. Results of experiments show that indexing based on such extracted sub- compounds improves both recall and pre- cision in an information retrieval system. The noun-phrase analysis techniques are also potentia...