Paper: Joint Chinese Word Segmentation, POS Tagging and Parsing

ACL ID D12-1046
Title Joint Chinese Word Segmentation, POS Tagging and Parsing
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2012
Authors

In this paper, we propose a novel decoding al- gorithm for discriminative joint Chinese word segmentation, part-of-speech (POS) tagging, and parsing. Previous work often used a pipeline method ? Chinese word segmentation followed by POS tagging and parsing, which suffers from error propagation and is unable to leverage information in later modules for earlier components. In our approach, we train the three individual models separately during training, and incorporate them together in a u- nified framework during decoding. We extend the CYK parsing algorithm so that it can deal with word segmentation and POS tagging fea- tures. As far as we know, this is the first work on joint Chinese word segmentation, POS tag- ging and parsing. Our experimental result- s on Chinese Tree Bank 5 corpus sho...