Paper: An Endogeneous Corpus-Based Method For Structural Noun Phrase Disambiguation

ACL ID E93-1011
Title An Endogeneous Corpus-Based Method For Structural Noun Phrase Disambiguation
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 1993
Authors
  • Didier Bourigault (School for Advanced Studies in the Social Sciences, Paris France; Electricity of France (EDF) Research Center, France)

In this paper, we describe a method for structural noun phrase disambiguation which mainly relies on the examination of the text corpus under analysis and doesn't need to integrate any domain-dependent lexico- or syntactico-semantic information. This method is implemented in the Terminology Extraction Sotware LEXTER. We first explain why the integration of LEXTER in the LEXTER-K project, which aims at building a tool for knowledge extraction from large technical text corpora, requires improving the quality of the terminolgy extracted by LEXTER. Then we briefly describe the way LEXTER works and show what kind of disambiguation it has to perform when parsing "maximal-length" noun phrases. We introduce a method of disambiguation which relies on a very simple idea : whenever LEXTER has to choo...