Paper: Tagging Sentence Boundaries

ACL ID A00-2035
Title Tagging Sentence Boundaries
Venue Annual Conference of the North American Chapter of the Association for Computational Linguistics
Session Main Conference
Year 2000

In this paper we tackle sentence boundary disam- biguation through a part-of-speech (POS) tagging framework. We describe necessary changes in text tokenization and the implementation of a POS tag- ger and provide results of an evaluation of this sys- tem on two corpora. We also describe an exten- sion of the traditional POS tagging by combining it with the document-centered approach to proper name identification and abbreviation handling. This made the resulting system robust to domain and topic shifts.