Paper: A Phonotactic Language Model For Spoken Language Identification

ACL ID P05-1064
Title A Phonotactic Language Model For Spoken Language Identification
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2005
Authors

We have established a phonotactic lan- guage model as the solution to spoken language identification (LID). In this framework, we define a single set of acoustic tokens to represent the acoustic activities in the world’s spoken languages. A voice tokenizer converts a spoken document into a text-like document of acoustic tokens. Thus a spoken document can be represented by a count vector of acoustic tokens and token n-grams in the vector space. We apply latent semantic analysis to the vectors, in the same way that it is applied in information retrieval, in order to capture salient phonotactics present in spoken documents. The vector space modeling of spoken utterances con- stitutes a paradigm shift in LID technol- ogy and has proven to be very successful. It presents a 12.4% error rate re...