Paper: Maximizing Component Quality in Bilingual Word-Aligned Segmentations

ACL ID E14-1004
Title Maximizing Component Quality in Bilingual Word-Aligned Segmentations
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 2014
Authors

Given a pair of source and target language sentences which are translations of each other with known word alignments between them, we extract bilingual phrase-level segmenta- tions of such a pair. This is done by identi- fying two appropriate measures that assess the quality of phrase segments, one on the mono- lingual level for both language sides, and one on the bilingual level. The monolingual mea- sure is based on the notion of partition refine- ments and the bilingual measure is based on structural properties of the graph that repre- sents phrase segments and word alignments. These two measures are incorporated in a ba- sic adaptation of the Cross-Entropy method for the purpose of extracting an N -best list of bilingual phrase-level segmentations. A straight-forward application of suc...