Paper: Phoneme-To-Text Transcription System With An Infinite Vocabulary

ACL ID P06-1092
Title Phoneme-To-Text Transcription System With An Infinite Vocabulary
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2006
Authors

The noisy channel model approach is suc- cessfully applied to various natural lan- guage processing tasks. Currently the main research focus of this approach is adaptation methods, how to capture char- acteristics of words and expressions in a target domain given example sentences in that domain. As a solution we describe a method enlarging the vocabulary of a lan- guage model to an almost infinite size and capturing their context information. Espe- cially the new method is suitable for lan- guages in which words are not delimited by whitespace. We applied our method to a phoneme-to-text transcription task in Japanese and reduced about 10% of the er- rors in the results of an existing method.