Paper: Automatically Merging Lexicons That Have Incompatible Part-Of-Speech Categories

ACL ID W99-0630
Title Automatically Merging Lexicons That Have Incompatible Part-Of-Speech Categories
Venue 2000 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora
Session Main Conference
Year 1999
Authors

We present a new method to automatically merge lexicons that employ different incom- patible POS categories. Such incompatibil- ities have hindered efforts to combine lexi- cons to maximize coverage with reasonable human effort. Given an "original lexicon", our method is able to merge lexemes from an "additional lexicon" into the original lex- icon, converting lexemes from the additional lexicon with about 89% precision. This level of precision is achieved with the aid of a device we introduce called an anti-lexicon, which neatly summarizes all the essential in- formation we need about the co-occurrence of tags and lemmas. Our model is intuitive, fast, easy to implement, and does not require heavy computational resources nor training corpus. lemma I tag apple INN boy NN calculate VB Exampl...