Paper: Extracting Terminologically Relevant Collocations in the Translation of Chinese Monograph

ACL ID I05-1088
Title Extracting Terminologically Relevant Collocations in the Translation of Chinese Monograph
Venue International Joint Conference on Natural Language Processing
Session Main Conference
Year 2005
Authors

This paper suggests a methodology which is aimed to extract the terminologically relevant collocations for translation purposes. Our basic idea is to use a hybrid method which combines the statistical method and linguistic rules. The extraction system used in our work operated at three steps: (1) Tokenization and POS tagging of the corpus; (2) Extraction of multi-word units using statistical measure; (3) Linguistic filtering to make use of syntactic patterns and stop-word list. As a result, hybrid method using linguistic filters proved to be a suitable method for selecting terminological collocations, it has considerably improved the precision of the extraction which is much higher than that of purely statistical method. In our test, hybrid method combining “Log-likelihood rat...