ACL Anthology Network (All About NLP) (beta) The Association Of Computational Linguistics Anthology Network |
ACL ID | W02-2001 |
---|---|
Title | Extracting The Unextractable: A Case Study On Verb-Particles |
Venue | International Conference on Computational Natural Language Learning |
Session | Main Conference |
Year | 2002 |
Authors |
|
This paper proposes a series of techniques for ex- tracting English verbparticle constructions from raw text corpora. We initially propose three basic methods, based on tagger output, chunker output and a chunk grammar, respectively, with the chunk grammar method optionally combining with an at- tachment resolution module to determine the syn- tactic structure of verbpreposition pairs in ambigu- ous constructs. We then combine the three methods together into a single classifler, and add in a number of extra lexical and frequentistic features, producing a flnal F-score of 0.865 over the WSJ.