Paper: Combining Hand-Crafted Rules And Unsupervised Learning In Constraint-Based Morphological Disambiguation

ACL ID W96-0207
Title Combining Hand-Crafted Rules And Unsupervised Learning In Constraint-Based Morphological Disambiguation
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 1996
Authors

In certain respects, our approach has been motivated by Brill's recent work (Brill, 1995b), but with the observation that his transformational approach is not directly applicable to languages like Turkish. Our system combines corpus independent handcrafted constraint rules, constraint rules that are learned via unsupervised learning from a training corpus, and additional statistical information from the corpus to be morphologically disambiguated. The hand-crafted rules are linguistically motivated and tuned to improve precision without sacrificing recall. The unsupervised learning process produces two sets of rules: (i) choose rules which choose morphological parses of a lexical item satisfying constraint effectively discarding other parses, and (ii) delete rules, which delete parses satis...