Paper: Automatic Prediction of Cognate Orthography Using Support Vector Machines

ACL ID P07-3005
Title Automatic Prediction of Cognate Orthography Using Support Vector Machines
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2007
Authors

This paper describes an algorithm to automatically generate a list of cognates in a target language by means of Support Vector Machines. While Levenshtein distance was used to align the training file, no knowledge repository other than an initial list of cognates used for training purposes was input into the algorithm. Evaluation was set up in a cognate production scenario which mimed a real- life situation where no word lists were available in the target language, delivering the ideal environment to test the feasibility of a more ambitious project that will involve language portability. An overall improvement of 50.58% over the baseline showed promising horizons.