Paper: Linguistically-Based Sub-Sentential Alignment for Terminology Extraction from a Bilingual Automotive Corpus

ACL ID C08-1067
Title Linguistically-Based Sub-Sentential Alignment for Terminology Extraction from a Bilingual Automotive Corpus
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2008
Authors

We present a sub-sentential alignment system that links linguistically motivated phrases in parallel texts based on lexical correspondences and syntactic similarity. We compare the performance of our sub- sentential alignment system with different symmetrizationheuristicsthatcombinethe GIZA++ alignments of both translation di- rections. We demonstrate that the aligned linguistically motivated phrases are a use- ful means to extract bilingual terminology and more specifically complex multiword terms.