Paper: Guiding A Well-Founded Parser With Corpus Statistics

ACL ID W99-0622
Title Guiding A Well-Founded Parser With Corpus Statistics
Venue 2000 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora
Session Main Conference
Year 1999
Authors

We present a parsing system built from a hand- written lexicon ~ and grammar, and trained on a selection of the Brown Corpus. On the sen- tences it can parse, the parser performs as well as purely corpus-based parsers. Its advantage lies in the fact that its syntactic analyses read- ily support semantic interpretation. Moreover, the system's hand-written foundation allows for a more fully lexicalized probabilistic model, i.e. one sensitive to co-occurrence of lexical heads of phrase constituents.