Paper: Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality

ACL ID E14-1056
Title Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 2014
Authors

Topic models based on latent Dirichlet al- location and related methods are used in a range of user-focused tasks including doc- ument navigation and trend analysis, but evaluation of the intrinsic quality of the topic model and topics remains an open research area. In this work, we explore the two tasks of automatic evaluation of single topics and automatic evaluation of whole topic models, and provide recom- mendations on the best strategy for per- forming the two tasks, in addition to pro- viding an open-source toolkit for topic and topic model evaluation.