Paper: Text Segmentation Based On Similarity Between Words

ACL ID P93-1041
Title Text Segmentation Based On Similarity Between Words
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1993
  • Hideki Kozima (University of Electro-Communications, Tokyo Japan)

This paper proposes a new indicator of text struc- ture, called the lexical cohesion profile (LCP), which locates segment boundaries in a text. A text segment is a coherent scene; the words in a segment a~e linked together via lexical cohesion relations. LCP records mutual similarity of words in a sequence of text. The similarity of words, which represents their cohesiveness, is computed using a semantic network. Comparison with the text segments marked by a number of subjects shows that LCP closely correlates with the hu- man judgments. LCP may provide valuable in- formation for resolving anaphora and ellipsis.