Paper: Improving Statistical Machine Translation In The Medical Domain Using The Unified Medical Language System

ACL ID C04-1114
Title Improving Statistical Machine Translation In The Medical Domain Using The Unified Medical Language System
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2004
Authors

Texts from the medical domain are an important task for natural language processing. This paper investigates the usefulness of a large medical database (the Unified Medical Language System) for the translation of dialogues between doctors and patients using a statistical machine translation system. We are able to show that the extraction of a large dictionary and the usage of semantic type information to generalize the training data significantly improves the translation performance.