Paper: Automatic Acquisition Of Adjectival Subcategorization From Corpora

ACL ID P05-1076
Title Automatic Acquisition Of Adjectival Subcategorization From Corpora
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2005
Authors

This paper describes a novel system for acquiring adjectival subcategorization frames (SCFs) and associated frequency information from English corpus data. The system incorporates a decision-tree classifier for 30 SCF types which tests for the presence of grammatical relations (GRs) in the output of a robust statisti- cal parser. It uses a powerful pattern- matching language to classify GRs into frames hierarchically in a way that mirrors inheritance-based lexica. The experiments show that the system is able to detect SCF types with 70% precision and 66% recall rate. A new tool for linguistic annotation of SCFs in corpus data is also introduced which can considerably alleviate the pro- cess of obtaining training and test data for subcategorization acquisition.