Paper: Aligning A Parallel English-Chinese Corpus Statistically With Lexical Criteria

ACL ID P94-1012
Title Aligning A Parallel English-Chinese Corpus Statistically With Lexical Criteria
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1994
Authors
  • Dekai Wu (University of Science and Technology, Clear Water Bay Hong Kong)

We describe our experience with automatic align- ment of sentences in parallel English-Chinese texts. Our report concerns three related topics: (1) progress on the HKUST English-Chinese Par- allel Bilingual Corpus; (2) experiments addressing the applicability of Gale ~ Church's (1991) length- based statistical method to the task of align- ment involving a non-Indo-European language; and (3) an improved statistical method that also incorporates domain-specific lexical cues.