Paper: Inducing Syntactic Categories By Context Distribution Clustering

ACL ID W00-0717
Title Inducing Syntactic Categories By Context Distribution Clustering
Venue International Conference on Computational Natural Language Learning
Session Main Conference
Year 2000
Authors

This paper addresses the issue of the automatic induction of syntactic categories from unanno- tared corpora. Previous techniques give good results, but fail to cope well with ambiguity or rare words. An algorithm, context distribution clustering (CDC), is presented which can be naturally extended to handle these problems.