Paper: Semi-supervised learning of morphological paradigms and lexicons

ACL ID E14-1060
Title Semi-supervised learning of morphological paradigms and lexicons
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 2014
Authors

We present a semi-supervised approach to the problem of paradigm induction from inflection tables. Our system ex- tracts generalizations from inflection ta- bles, representing the resulting paradigms in an abstract form. The process is in- tended to be language-independent, and to provide human-readable generalizations of paradigms. The tools we provide can be used by linguists for the rapid cre- ation of lexical resources. We evaluate the system through an inflection table recon- struction task using Wiktionary data for German, Spanish, and Finnish. With no additional corpus information available, the evaluation yields per word form ac- curacy scores on inflecting unseen base forms in different languages ranging from 87.81% (German nouns) to 99.52% (Span- ish verbs); with additional unlab...