Paper: Cross-Lingual Lexical Triggers In Statistical Language Modeling

ACL ID W03-1003
Title Cross-Lingual Lexical Triggers In Statistical Language Modeling
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2003

We propose new methods to take advan- tage of text in resource-rich languages to sharpen statistical language models in resource-deficient languages. We achieve this through an extension of the method of lexical triggers to the cross-language problem, and by developing a likelihood- based adaptation scheme for combining a trigger model with an a1 -gram model. We describe the application of such lan- guage models for automatic speech recog- nition. By exploiting a side-corpus of con- temporaneous English news articles for adapting a static Chinese language model to transcribe Mandarin news stories, we demonstrate significant reductions in both perplexity and recognition errors. We also compare our cross-lingual adaptation scheme to monolingual language model adaptation, and to an alternate ...