Paper: Unsupervised Named Entity Transliteration Using Temporal And Phonetic Correlation

ACL ID W06-1630
Title Unsupervised Named Entity Transliteration Using Temporal And Phonetic Correlation
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2006
Authors

In this paper we investigate unsuper- vised name transliteration using compara- ble corpora, corpora where texts in the two languages deal in some of the same top- ics — and therefore share references to named entities — but are not translations of each other. We present two distinct methods for transliteration, one approach using an unsupervised phonetic translit- eration method, and the other using the temporal distribution of candidate pairs. Each of these approaches works quite well, but by combining the approaches one can achieve even better results. We believe that the novelty of our approach lies in the phonetic-based scoring method, which is based on a combination of care- fully crafted phonetic features, and empiri- cal results from the pronunciation errors of second-language ...