Paper: Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora

ACL ID P08-1089
Title Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2008
Authors

Paraphrase patterns are useful in paraphrase recognition and generation. In this paper, we present a pivot approach for extracting para- phrase patterns from bilingual parallel cor- pora, whereby the English paraphrase patterns are extracted using the sentences in a for- eign language as pivots. We propose a log- linear model to compute the paraphrase likeli- hood of two patterns and exploit feature func- tions based on maximum likelihood estima- tion (MLE) and lexical weighting (LW). Us- ing the presented method, we extract over 1,000,000 pairs of paraphrase patterns from 2M bilingual sentence pairs, the precision of which exceeds 67%. The evaluation re- sults show that: (1) The pivot approach is effective in extracting paraphrase patterns, which significantly outperforms the conven- tion...