Paper: Semi-Supervised Maximum Entropy Based Approach To Acronym And Abbreviation Normalization In Medical Texts

ACL ID P02-1021
Title Semi-Supervised Maximum Entropy Based Approach To Acronym And Abbreviation Normalization In Medical Texts
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2002
Authors

Text normalization is an important aspect of successful information retrieval from medical documents such as clinical notes, radiology reports and discharge summaries. In the medical domain, a significant part of the general problem of text normalization is abbreviation and acronym disambiguation. Numerous abbreviations are used routinely throughout such texts and knowing their meaning is critical to data retrieval from the document. In this paper I will demonstrate a method of automatically generating training data for Maximum Entropy (ME) modeling of abbreviations and acronyms and will show that using ME modeling is a promising technique for abbreviation and acronym normalization. I report on the results of an experiment involving training a number of ME models used to normalize abbrevia...