Paper: Surface Grammatical Analysis For The Extraction Of Terminological Noun Phrases

ACL ID C92-3150
Title Surface Grammatical Analysis For The Extraction Of Terminological Noun Phrases
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1992
Authors
  • Didier Bourigault (School for Advanced Studies in the Social Sciences, Paris France; Electricity of France (EDF) Research Center, France)

LEXTER is a software package for extracting terminology. A corpus of French language texts on any subject field is fed in, and LEXTER produces a list of likely terminological units to be submitted to an expert to be validated. To identify the terminological units, LEXTER takes their form into account and proceeds in two main stages : analysis, parsing. In the first stage, LEXTER uses a base of rules designed to indentify frontier markers in view to analysing the texts and extracting maximal- length noun phrases. In the second stage, LEXTER parses these maximal-length noun phrases to extract subgroups which by virtue of their grammatical structure and their place in the maximal-length noun phrases are likely to be terminological units. In this article, the type of analysis used (surface gra...