Paper: Building An MT Dictionary From Parallel Texts Based On Linguistic And Statistical Information

ACL ID C94-1009
Title Building An MT Dictionary From Parallel Texts Based On Linguistic And Statistical Information
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1994
Authors

A method for generating a machine translation (MT) dictionary from parallel texts is described. This method utilizes both statistical information and linguistic information to obtain corresponding words or phrases in parallel texts. By combining these two types of information, translation pairs which cannot be obtained by a linguistic-based method can be extntcted. Over 70% accurate transla- tions of compound nouns and over 50% of unknown words are obtained as tbe first candidate from small Japanese/Englisb parallel texts containing severe dis- tortions.