Paper: Is Knowledge-Free Induction Of Multiword Unit Dictionary Headwords A Solved Problem?

ACL ID W01-0513
Title Is Knowledge-Free Induction Of Multiword Unit Dictionary Headwords A Solved Problem?
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2001
Authors

We seek a knowledge-free method for inducing multiword units from text corpora for use as machine-readable dictionary headwords. We provide two major evaluations of nine existing collocation-finders and illustrate the continuing need for improvement. We use Latent Semantic Analysis to make modest gains in performance, but we show the significant challenges encountered in trying this approach.