Paper: Towards The Orwellian Nightmare: Separation Of Business And Personal Emails

ACL ID P06-2053
Title Towards The Orwellian Nightmare: Separation Of Business And Personal Emails
Venue Annual Meeting of the Association of Computational Linguistics
Session Poster Session
Year 2006
Authors

This paper describes the largest scale annotation pro- ject involving the Enron email corpus to date. Over 12,500 emails were classified, by humans, into the categories “Business” and “Personal”, and then sub- categorised by type within these categories. The paper quantifies how well humans perform on this task (evaluated by inter-annotator agreement). It presents the problems experienced with the separation of these language types. As a final section, the paper presents preliminary results using a machine to perform this classification task.