Paper: Broad Coverage Automatic Morphological Segmentation Of German Words

ACL ID C92-4195
Title Broad Coverage Automatic Morphological Segmentation Of German Words
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1992
Authors

A system for the automatic segmentation of German words into morphs was developed. The main linguistic knowledge sources used by the system are a word syntax and a morph dictionary. The syntax is written in the formalism of right linear regular grammars and comprises approximately 1,400 rules de- scribing the set of those sequences of morph classes which underlie syntactically well formed words. The morph dictionary contains almost 11,000 morphs. Each morph is as- signed to up to 6 morph classes. - Statistical evaluations with 6000 test words showed that more than 99% of the segmented words got a correct segmentation.