Paper: Towards History-Based Grammars: Using Richer Models For Probabilistic Parsing

ACL ID P93-1005
Title Towards History-Based Grammars: Using Richer Models For Probabilistic Parsing
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1993
Authors

We describe a generative probabilistic model of natural language, which we call HBG, that takes advantage of detailed linguistic information to re- solve ambiguity. HBG incorporates lexical, syn- tactic, semantic, and structural information from the parse tree into the disambiguation process in a novel way. We use a corpus of bracketed sentences, called a Treebank, in combination with decision tree building to tease out the relevant aspects of a parse tree that will determine the correct parse of a sentence. This stands in contrast to the usual ap- proach of further grammar tailoring via the usual linguistic introspection in the hope of generating the correct parse. In head-to-head tests against one of the best existing robust probabilistic pars- ing models, which we call P-CFG, the HBG mo...