Paper: Hippocratic Abbreviation Expansion

ACL ID P14-2060
Title Hippocratic Abbreviation Expansion
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2014

Incorrect normalization of text can be par- ticularly damaging for applications like text-to-speech synthesis (TTS) or typing auto-correction, where the resulting nor- malization is directly presented to the user, versus feeding downstream applications. In this paper, we focus on abbreviation expansion for TTS, which requires a ?do no harm?, high precision approach yield- ing few expansion errors at the cost of leaving relatively many abbreviations un- expanded. In the context of a large- scale, real-world TTS scenario, we present methods for training classifiers to establish whether a particular expansion is apt. We achieve a large increase in correct abbrevi- ation expansion when combined with the baseline text normalization component of the TTS system, together with a substan- tial redu...