Paper: Regularized Interlingual Projections: Evaluation on Multilingual Transliteration

ACL ID D12-1002
Title Regularized Interlingual Projections: Evaluation on Multilingual Transliteration
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2012
Authors

In this paper, we address the problem of build- ing a multilingual transliteration system using an interlingual representation. Our approach uses international phonetic alphabet (IPA) to learn the interlingual representation and thus allows us to use any word and its IPA repre- sentation as a training example. Thus, our ap- proach requires only monolingual resources: a phoneme dictionary that lists words and their IPA representations.1 By adding a phoneme dictionary of a new language, we can readily build a transliteration system into any of the existing previous languages, without the ex- pense of all-pairs data or computation. We also propose a regularization framework for learning the interlingual representation, which accounts for language specific phonemic vari- ability, and thus it c...