Paper: IIIT-H: A Corpus-Driven Co-occurrence Based Probabilistic Model for Noun Compound Paraphrasing

ACL ID S13-2028
Title IIIT-H: A Corpus-Driven Co-occurrence Based Probabilistic Model for Noun Compound Paraphrasing
Venue Joint Conference on Lexical and Computational Semantics
Session
Year 2013
Authors

This paper presents a system for automatically generating a set of plausible paraphrases for a given noun compound and rank them in de- creasing order of their usage represented by the confidence value provided by the human annotators. Our system implements a corpus- driven probabilistic co-occurrence based model for predicting the paraphrases, that uses a seed list of paraphrases extracted from cor- pus to predict other paraphrases based on their co-occurrences. The corpus study reveals that the prepositional paraphrases for the noun compounds are quite frequent and well cov- ered but the verb paraphrases, on the other hand, are scarce, revealing the unsuitability of the model for standalone corpus-driven ap- proach. Therefore, to predict other paraphras- es, we adopt a two-fol...