Paper: Applying Collocation Segmentation to the ACL Anthology Reference Corpus

ACL ID W12-3207
Title Applying Collocation Segmentation to the ACL Anthology Reference Corpus
Venue ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Session
Year 2012
Authors

Collocation is a well-known linguistic phe- nomenon which has a long history of research and use. In this study I employ collocation segmentation to extract terms from the large and complex ACL Anthology Reference Cor- pus, and also briefly research and describe the history of the ACL. The results of the study show that until 1986, the most signifi- cant terms were related to formal/rule based methods. Starting in 1987, terms related to statistical methods became more important. For instance, language model, similarity mea- sure, text classification. In 1990, the terms Penn Treebank, Mutual Information , statis- tical parsing, bilingual corpus, and depen- dency tree became the most important, show- ing that newly released language resources ap- peared together with many new research areas ...