Paper: A Divide-And-Conquer Strategy For Shallow Parsing Of German Free Texts

ACL ID A00-1033
Title A Divide-And-Conquer Strategy For Shallow Parsing Of German Free Texts
Venue Annual Conference of the North American Chapter of the Association for Computational Linguistics
Session Main Conference
Year 2000
Authors

We present a divide-and-conquer strategy based on finite state technology for shallow parsing of real- world German texts. In a first phase only the topo- logical structure of a sentence (i.e. , verb groups, subclauses) are determined. In a second phase the phrasal grammars are applied to the contents of the different fields of the main and sub-clauses. Shallow parsing is supported by suitably configured prepro- cessing, including: morphological and on-line com- pound analysis, efficient POS-filtering, and named entity recognition. The whole approach proved to be very useful for processing of free word order lan- guages like German. Especially for the divide-and- conquer parsing strategy we obtained an f-measure of 87.14% on unseen data.