ACL Anthology Network (All About NLP) (beta) The Association Of Computational Linguistics Anthology Network |
ACL ID | I08-2111 |
---|---|
Title | Annotation of Multiword Expressions in the Prague Dependency Treebank |
Venue | International Joint Conference on Natural Language Processing |
Session | Main Conference |
Year | 2008 |
Authors |
|
In this article we want to demonstrate that annotation of multiword expressions in the Prague Dependency Treebank is a well de- fined task, that it is useful as well as feasible, and that we can achieve good consistency of such annotations in terms of inter-annotator agreement. We show a way to measure agree- ment for this type of annotation. We also ar- gue that some automatic pre-annotation is possible and it does not damage the results. 1 Motivation Various projects involving lexico-semantic annota- tion have been ongoing for many years. Among those there are the projects of word sense annotation, usu- ally for creating training data for word sense disam- biguation. However majority of these projects have only annotated very limited number of word senses (cf. Kilgarriff (1998)). Even am...