Paper: Towards A Modular Data Model For Multi-Layer Annotated Corpora

ACL ID P06-2024
Title Towards A Modular Data Model For Multi-Layer Annotated Corpora
Venue Annual Meeting of the Association of Computational Linguistics
Session Poster Session
Year 2006
Authors
  • Richard Eckart (Darmstadt University of Technology, Darmstadt Germany)

In this paper we discuss the current meth- ods in the representation of corpora anno- tatedatmultiplelevelsoflinguisticorgani- zation (so-called multi-level or multi-layer corpora). Taking five approaches which are representative of the current practice in this area, we discuss the commonalities and differences between them focusing on the underlying data models. The goal of the paper is to identify the common con- cerns in multi-layer corpus representation and processing so as to lay a foundation for a unifying, modular data model.