Paper: HulTech: A General Purpose System for Cross-Level Semantic Similarity based on Anchor Web Counts

ACL ID S14-2050
Title HulTech: A General Purpose System for Cross-Level Semantic Similarity based on Anchor Web Counts
Venue Joint Conference on Lexical and Computational Semantics
Session
Year 2014
Authors

This paper describes the HULTECH team par- ticipation in Task 3 of SemEval-2014. Four different subtasks are provided to the partici- pants, who are asked to determine the semantic similarity of cross-level test pairs: paragraph- to-sentence, sentence-to-phrase, phrase-to- word and word-to-sense. Our system adopts a unified strategy (general purpose system) to calculate similarity across all subtasks based on word Web frequencies. For that purpose, we define ClueWeb InfoSimba, a cross-level similarity corpus-based metric. Results show that our strategy overcomes the proposed base- lines and achieves adequate to moderate re- sults when compared to other systems.