Paper: Dependency Treebank For Russian: Concept Tools Types Of Information

ACL ID C00-2143
Title Dependency Treebank For Russian: Concept Tools Types Of Information
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2000
Authors

'File paper describes a tagging scheme designed for the Russian Treebank, and presents tools used for corpus creation. 1. lntrodudory Remarks The present paper describes a project aimed at developing the first annotated corpus of P, ussian texts. I.arge text coq~ora trove been used in the computational linguistics community long enough: at present, over 20 large corpora for the main European languages arc available, the largest of them containing hundreds of millions of words (I.anguage Resources (19971); Marcus, Santorini, and Marcinkiewicz (1993); Kurohashi, Nagao (1998)). So far, however, no annotated corpora for Russian have been developed. To the best of our knowledge, the present project is the first attempt to fill the gap. l)ifferent tasks require different annotation levels that e...