Paper: Unsupervised learning of rhetorical structure with un-topic models

ACL ID C14-1002
Title Unsupervised learning of rhetorical structure with un-topic models
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2014
Authors

In this paper we investigate whether unsupervised models can be used to induce conventional aspects of rhetorical language in scientific writing. We rely on the intuition that the rhetorical language used in a document is general in nature and independent of the document?s topic. We describe a Bayesian latent-variable model that implements this intuition. In two empirical evaluations based on the task of argumentative zoning (AZ), we demonstrate that our generality hypothesis is crucial for distinguishing between rhetorical and topical language and that features provided by our unsupervised model trained on a large corpus can improve the performance of a supervised AZ classifier.