Paper: The Tradeoffs Between Open and Traditional Relation Extraction

ACL ID P08-1004
Title The Tradeoffs Between Open and Traditional Relation Extraction
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2008
Authors

Traditional Information Extraction (IE) takes a relation name and hand-tagged examples of that relation as input. Open IE is a relation- independent extraction paradigm that is tai- lored to massive and heterogeneous corpora suchastheWeb. AnOpenIEsystemextractsa diverse set of relational tuples from text with- out any relation-specific input. How is Open IE possible? We analyze a sample of English sentences to demonstrate that numerous rela- tionships are expressed using a compact set of relation-independent lexico-syntactic pat- terns, whichcanbelearnedbyanOpenIEsys- tem. What are the tradeoffs between Open IE and traditional IE? We consider this question in the context of two tasks. First, when the number of relations is massive, and the rela- tions themselves are not pre-specified, we a...