Paper: Using A Hybrid System Of Corpus- And Knowledge-Based Techniques To Automate The Induction Of A Lexical Sublanguage Grammar

ACL ID C96-2213
Title Using A Hybrid System Of Corpus- And Knowledge-Based Techniques To Automate The Induction Of A Lexical Sublanguage Grammar
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1996
Authors

Porting a Natural Language Processing (NLP) system to a new donmin renmins one of the bottlenecks in syntactic parsing, because of the amount of effort required to fix gaps in the lexicon, and to attune the existing grammar to the idiosyncra- cics of the new sublanguage. This paper shows how thc process of fitting a lexicalizcd grammar to a domain can be automated to a great extent by using a hybrid system that combines traditimml knowledge- based techniques with a corpus-based approach. 1. Porting Bottleneck The trMitional gramnmr knowledgebase is the product of a never-ending attempt by linguists to impose order on something that refuses to be pinned down because it is a living thing. To a great extent, of course, these linguists are able to point to regularities, because language is fir...