Paper: Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars

ACL ID P12-1071
Title Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2012
Authors

We present a simple and effective framework for exploiting multiple monolingual treebanks with different annotation guidelines for pars- ing. Several types of transformation patterns (TP) are designed to capture the systematic an- notation inconsistencies among different tree- banks. Based on such TPs, we design quasi- synchronous grammar features to augment the baseline parsing models. Our approach can significantly advance the state-of-the-art pars- ing accuracy on two widely used target tree- banks (Penn Chinese Treebank 5.1 and 6.0) using the Chinese Dependency Treebank as the source treebank. The improvements are respectively 1.37% and 1.10% with automatic part-of-speech tags. Moreover, an indirect comparison indicates that our approach also outperforms previous work based on treebank...