Paper: Acquiring Hyponymy Relations From Web Documents

ACL ID N04-1010
Title Acquiring Hyponymy Relations From Web Documents
Venue Human Language Technologies
Session Main Conference
Year 2004

This paper describes an automatic method for acquiring hyponymy relations from HTML documents on the WWW. Hyponymy relations can play a crucial role in various natural lan- guage processing systems. Most existing ac- quisition methods for hyponymy relations rely on particular linguistic patterns, such as “NP such as NP”. Our method, however, does not use such linguistic patterns, and we expect that our procedure can be applied to a wide range of expressions for which existing meth- ods cannot be used. Our acquisition algo- rithm uses clues such as itemization or listing in HTML documents and statistical measures such as document frequencies and verb-noun co-occurrences.