Paper: Meaning Unit Segmentation in English and Chinese: a New Approach to Discourse Phenomena

ACL ID W13-3301
Title Meaning Unit Segmentation in English and Chinese: a New Approach to Discourse Phenomena
Venue Discourse in Machine Translation
Session
Year 2013
Authors

2Institute for Infocomm Research, 1 Fusionpolis Way, Singaporejaw97@georgetown.edu {rembanchs,hli}@i2r.a-star.edu.sgAbstract We present a new approach to dialogue processing in terms of ?meaning units?. In our annotation task, we asked speakers of English and Chinese to mark boundaries where they could construct the maximal concept using minimal words. We com- pared English data across genres (news, literature, and policy). We analyzed the agreement for annotators using a state-of- the-art segmentation similarity algorithm and compared annotations with a random baseline. We found that annotators are able to identify meaning units systemati- cally, even though they may disagree on the quantity and position of units. Our analysis includes an examination of phrase structure for annotated unit...