Paper: Using An Annotated Corpus As A Stochastic Grammar

ACL ID E93-1006
Title Using An Annotated Corpus As A Stochastic Grammar
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 1993
Authors
  • Rens Bod (University of Amsterdam, Amsterdam The Netherlands)

In Data Oriented Parsing (DOP), an annotated corpus is used as a stochastic grammar. An input string is parsed by combining subtrees from the corpus. As a consequence, one parse tree can usually be generated by several derivations that involve different subtrces. This leads to a statistics where the probability of a parse is equal to the sum of the probabilities of all its derivations.