Paper: Case markers and Morphology: Addressing the crux of the fluency problem in English-Hindi SMT

ACL ID P09-1090
Title Case markers and Morphology: Addressing the crux of the fluency problem in English-Hindi SMT
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2009
Authors

We report in this paper our work on accurately generating case markers and suffixes in English-to-Hindi SMT. Hindi is a relatively free word-order language, and makes use of a comparatively richer set of case markers and morphological suffixes for correct meaning representa- tion. From our experience of large-scale English-Hindi MT, we are convinced that fluency and fidelity in the Hindi output get an order of magnitude facelift if accurate case markers and suffixes are produced. Now, the moot question is: what entity on the English side encodes the information contained in case markers and suffixes on the Hindi side? Our studies of correspon- dences in the two languages show that case markers and suffixes in Hindi are predom- inantly determined by the combination of suffixes and semantic ...