Paper: You Talking to Me? A Corpus and Algorithm for Conversation Disentanglement

ACL ID P08-1095
Title You Talking to Me? A Corpus and Algorithm for Conversation Disentanglement
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2008
Authors

When multiple conversations occur simultane- ously, a listener must decide which conversa- tion each utterance is part of in order to inter- pret and respond to it appropriately. We refer to this task as disentanglement. We present a corpus of Internet Relay Chat (IRC) dialogue in which the various conversations have been manually disentangled, and evaluate annota- tor reliability. This is, to our knowledge, the first such corpus for internet chat. We pro- pose a graph-theoretic model for disentangle- ment, using discourse-based features which have not been previously applied to this task. The model’s predicted disentanglements are highly correlated with manual annotations. 1 Motivation Simultaneous conversations seem to arise naturally in both informal social interactions and multi-part...