Paper: Terminological Variation a Means of Identifying Research Topics from Texts

ACL ID P98-1092
Title Terminological Variation a Means of Identifying Research Topics from Texts
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1998
Authors

After extracting terms from a corpus of titles and abstracts in English, syntactic variation relations are identified amongst them in order to detect research topics. Three types of syntactic variations were studied : permutation, expansion and substitution. These syntactic variations yield other relations of formal and conceptual nature. Basing on a distinction of the variation relations according to the grammatical function affected in a term - head or modifier - term variants are first clustered into connected components which are in turn clustered into classes. These classes relate two or more components through variations involving a change of head word, thus of topic. The graph obtained reveals the global organisation of research topics in the corpus. A clustering method has been bui...