Paper: Syntactic Analysis Of Natural Language Using Linguistic Rules And Corpus-Based Patterns

ACL ID C94-1104
Title Syntactic Analysis Of Natural Language Using Linguistic Rules And Corpus-Based Patterns
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1994
Authors

We are concerned with the syntactic annotation of unrestricted text. We combine a rule-based analysis with subsequent exploitation of empiri- cal data. The rule~based surface syntactic anal- yser leaves some amount of ambiguity in the out- put that is resolved using empirical patterns. We have implemented a system for generating and applying corpus-based patterns. Somc patterns describe the main constituents in the sentence and some the local context of the each syntac- tic function. There are several (partly) redml- tant patterns, and the "pattern" parser selects analysis of the sentence ttmt matches the strictest possible pattern(s). The system is applied to an experimeutal corpus. We present the results and discuss possible refinements of the method from a linguistic point of view.