Paper: Classification of South African languages using text and acoustic based methods: A case of six selected languages

ACL ID N13-1028
Title Classification of South African languages using text and acoustic based methods: A case of six selected languages
Venue Annual Conference of the North American Chapter of the Association for Computational Linguistics
Session Main Conference
Year 2013
Authors

Language variations are generally known to have a severe impact on the performance of Human Language Technology Systems. In or- der to predict or improve system performance, a thorough investigation into these variations, similarities and dissimilarities, is required. Distance measures have been used in several applications of speech processing to analyze different varying speech attributes. However, not much work has been done on language dis- tance measures, and even less work has been done involving South African languages. This study explores two methods for measuring the linguistic distance of six South African lan- guages. It concerns a text based method, (the Levenshtein Distance), and an acoustic ap- proach using extracted mean pitch values. The Levenshtein distance us...