Paper: Representing Text Chunks

ACL ID E99-1023
Title Representing Text Chunks
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 1999
Authors

Dividing sentences in chunks of words is a useful preprocessing step for parsing, information extraction and information retrieval. (l~mshaw and Marcus, 1995) have introduced a "convenient" data rep- resentation for chunking by converting it to a tagging task. In this paper we will examine seven different data repre- sentations for the problem of recogniz- ing noun phrase chunks. We will show that the the data representation choice has a minor influence on chunking per- formance. However, equipped with the most suitable data representation, our memory-based learning chunker was able to improve the best published chunking results for a standard data set.