Paper: Names And Similarities On The Web: Fact Extraction In The Fast Lane

ACL ID P06-1102
Title Names And Similarities On The Web: Fact Extraction In The Fast Lane
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2006
Authors

In a new approach to large-scale extrac- tion of facts from unstructured text, dis- tributional similarities become an integral part of both the iterative acquisition of high-coverage contextual extraction pat- terns, and the validation and ranking of candidate facts. The evaluation mea- sures the quality and coverage of facts extracted from one hundred million Web documents, starting from ten seed facts and using no additional knowledge, lexi- cons or complex tools.