Paper: A Pattern Matching Method For Finding Noun And Proper Noun Translations From Noisy Parallel Corpora

ACL ID P95-1032
Title A Pattern Matching Method For Finding Noun And Proper Noun Translations From Noisy Parallel Corpora
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1995
Authors

We present a pattern matching method for compiling a bilingual lexicon of nouns and proper nouns from unaligned, noisy paral- lel texts of Asian/Indo-European language pairs. Tagging information of one lan- guage is used. Word frequency and posi- tion information for high and low frequency words are represented in two different vec- tor forms for pattern matching. New an- chor point finding and noise elimination techniques are introduced. We obtained a 73.1% precision. We also show how the results can be used in the compilation of domain-specific noun phrases. 1 Bilingual lexicon compilation without sentence alignment Automatically compiling a bilingual lexicon of nouns and proper nouns can contribute significantly to breaking the bottleneck in machine translation and machine-aided transla...