Paper: Automatic Acquisition Of Domain Knowledge For Information Extraction

ACL ID C00-2136
Title Automatic Acquisition Of Domain Knowledge For Information Extraction
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2000
Authors

In developing an Infbrmation Extraction tIE) system tbr a new class of events or relations, one of the major tasks is identifying the many ways in which these events or relations may be ex- pressed in text. This has generally involved the manual analysis and, in some cases, the anno- tation of large quantities of text involving these events. This paper presents an alternative ap- proach, based on an automatic discovery pro- cedure, ExDIsCO, which identifies a set; of rele- wmt documents and a set of event patterns from un-annotated text, starting from a small set of "seed patterns". We evaluate ExDIScO by com- paring the pertbrmance of discovered patterns against that of manually constructed systems on actual extraction tasks.