Paper: Special Techniques for Constituent Parsing of Morphologically Rich Languages

ACL ID E14-1015
Title Special Techniques for Constituent Parsing of Morphologically Rich Languages
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 2014
Authors

We introduce three techniques for improv- ing constituent parsing for morphologi- cally rich languages. We propose a novel approach to automatically find an optimal preterminal set by clustering morphologi- cal feature values and we conduct exper- iments with enhanced lexical models and feature engineering for rerankers. These techniques are specially designed for mor- phologically rich languages (but they are language-agnostic). We report empirical results on the treebanks of five morpho- logically rich languages and show a con- siderable improvement in accuracy and in parsing speed as well.