Paper: Speech Recognition Of Czech - Inclusion Of Rare Words Helps

ACL ID P05-2021
Title Speech Recognition Of Czech - Inclusion Of Rare Words Helps
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2005
Authors

Large vocabulary continuous speech recognition of in ective languages, such as Czech, Russian or Serbo-Croatian, is heavily deteriorated by excessive out of vocabulary rate. In this paper, we tackle the problem of vocabulary selection, lan- guage modeling and pruning for in ective languages. We show that by explicit reduction of out of vocabulary rate we can achieve signi cant improvements in recognition accuracy while almost preserving the model size. Reported results are on Czech speech corpora.