Paper: Simple English Wikipedia: A New Text Simplification Task

ACL ID P11-2117
Title Simple English Wikipedia: A New Text Simplification Task
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2011
Authors

In this paper we examine the task of sentence simplification which aims to reduce the read- ing complexity of a sentence by incorporat- ing more accessible vocabulary and sentence structure. We introduce a new data set that pairs English Wikipedia with Simple English Wikipedia and is orders of magnitude larger than any previously examined for sentence simplification. The data contains the full range of simplification operations including reword- ing, reordering, insertion and deletion. We provide an analysis of this corpus as well as preliminary results using a phrase-based trans- lation approach for simplification.