Paper: An Alternative Method Of Training Probabilistic LR Parsers

ACL ID P04-1070
Title An Alternative Method Of Training Probabilistic LR Parsers
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2004

We discuss existing approaches to train LR parsers, which have been used for statistical resolution of structural ambiguity. These approaches are non- optimal, in the sense that a collection of probability distributions cannot be obtained. In particular, some probability distributions expressible in terms of a context-free grammar cannot be expressed in terms of the LR parser constructed from that grammar, under the restrictions of the existing approaches to training of LR parsers. We present an alternative way of training that is provably optimal, and that al- lows all probability distributions expressible in the context-free grammar to be carried over to the LR parser. We also demonstrate empirically that this kind of training can be effectively applied on a large treebank.