Paper: Automatic Acquisition Of A Large Sub Categorization Dictionary From Corpora

ACL ID P93-1032
Title Automatic Acquisition Of A Large Sub Categorization Dictionary From Corpora
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1993
Authors

This paper presents a new method for producing a dictionary of subcategorization frames from un- labelled text corpora. It is shown that statistical filtering of the results of a finite state parser run- ning on the output of a stochastic tagger produces high quality results, despite the error rates of the tagger and the parser. Further, it is argued that this method can be used to learn all subcategori- zation frames, whereas previous methods are not extensible to a general solution to the problem.