Paper: Term Extraction -I- Term Clustering: An Integrated Platform For Computer-Aided Terminology

ACL ID E99-1003
Title Term Extraction -I- Term Clustering: An Integrated Platform For Computer-Aided Terminology
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 1999
Authors

A novel technique for automatic the- saurus construction is proposed. It is based on the complementary use of two tools: (1) a Term Extraction tool that acquires term candidates from tagged corpora through a shallow grammar of noun phrases, and (2) a Term Cluster- ing tool that groups syntactic variants (insertions). Experiments performed on corpora in three technical domains yield clusters of term candidates with preci- sion rates between 93% and 98%. 1 Computational Terminology In the domain of corpus-based terminology two types of tools are currently developed: tools for automatic term extraction (Bourigault, 1993; Justeson and Katz, 1995; Daille, 1996; Brun, 1998) and tools for automatic thesaurus construc- tion (Grefenstette, 1994). These tools are ex- pected to be complementary in th...