Paper: Terminological variation a means of identifying research topics from texts

ACL ID C98-1089
Title Terminological variation a means of identifying research topics from texts
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1998
Authors

After extracting terms from a corpus of titles and abstracts in English, syntactic variation relations are identified amongst them in order to detect research topics. Three types of syntactic variations were studied : permutation, expansion and substitution. These syntactic variations yield other relations of formal and conceptual nature. Basing on a distinction of the variation relations according to the grammatical function affected in a term - head or modifier - term variants are first clustered into connected components which are in turn clustered into classes. These classes relate two or more components through variations involving a change of head word, thus of topic. The graph obtained reveals the global organisation of research topics in the corpus. A clustering me...