Paper: Bilingual dictionary generation for low-resourced language pairs

ACL ID D09-1090
Title Bilingual dictionary generation for low-resourced language pairs
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2009

Bilingual dictionaries are vital resources in many areas of natural language processing. Numerous methods of machine translation re- quire bilingual dictionaries with large cover- age, but less-frequent language pairs rarely have any digitalized resources. Since the need for these resources is increasing, but the hu- man resources are scarce for less represented languages, efficient automatized methods are needed. This paper introduces a fully auto- mated, robust pivot language based bilingual dictionary generation method that uses the WordNet of the pivot language to build a new bilingual dictionary. We propose the usage of WordNet in order to increase accuracy; we also introduce a bidirectional selection method with a flexible threshold to maximize recall. Our evaluations s...