Paper: Formal Description Of Multi-Word Lexemes With The Finite-State Formalism IDAREX

ACL ID C96-2182
Title Formal Description Of Multi-Word Lexemes With The Finite-State Formalism IDAREX
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1996
Authors

Most multi-word lexemes (MWLs) allow certain types of variation. This has to be taken into account for their description and their recognition in texts. We sug- gest to describe their syntactic restric- tions and their idiosyncratic peculiarities with local grammar rules, which at the same time allow to express in a general way regularities valid for a whole class of MWLs. The local grammars can be writ- ten in a very convenient and compact way as regular expressions in the formal- ism IDAREX which uses a two-level mor- phology. IDAREX allows to define various types of variables, and to mix canonical and inflected word forms in the regular expressions.