Paper: A Statistical Model For Domain-Independent Text Segmentation

ACL ID P01-1064
Title A Statistical Model For Domain-Independent Text Segmentation
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2001
Authors

We propose a statistical method that finds the maximum-probability seg- mentation of a given text. This method does not require training data because it estimates probabilities from the given text. Therefore, it can be applied to any text in any domain. An experi- ment showed that the method is more accurate than or at least as accurate as a state-of-the-art text segmentation sys- tem.