Paper: Movie-DiC: a Movie Dialogue Corpus for Research and Development

ACL ID P12-2040
Title Movie-DiC: a Movie Dialogue Corpus for Research and Development
Venue Annual Meeting of the Association of Computational Linguistics
Session Short Paper
Year 2012
Authors

This paper describes Movie-DiC a Movie Dialogue Corpus recently collected for re- search and development purposes. The col- lected dataset comprises 132,229 dialogues containing a total of 764,146 turns that have been extracted from 753 movies. De- tails on how the data collection has been created and how it is structured are pro- vided along with its main statistics and cha- racteristics.