Paper: Interpolated Dirichlet Class Language Model for Speech Recognition Incorporating Long-distance N-grams

ACL ID C14-1169
Title Interpolated Dirichlet Class Language Model for Speech Recognition Incorporating Long-distance N-grams
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2014
Authors

We propose a language modeling (LM) approach incorporating interpolated distanced n-grams in a Dirichlet class language model (DCLM) (Chien and Chueh, 2011) for speech recognition. The DCLM relaxes the bag-of-words assumption and documents topic extraction of latent Dirichlet allocation (LDA). The latent variable of DCLM reflects the class information of an n-gram event