Paper: A Knowledge-based Representation for Cross-Language Document Retrieval and Categorization

ACL ID E14-1044
Title A Knowledge-based Representation for Cross-Language Document Retrieval and Categorization
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 2014
Authors

Current approaches to cross-language doc- ument retrieval and categorization are based on discriminative methods which represent documents in a low-dimensional vector space. In this paper we pro- pose a shift from the supervised to the knowledge-based paradigm and provide a document similarity measure which draws on BabelNet, a large multilingual knowl- edge resource. Our experiments show state-of-the-art results in cross-lingual document retrieval and categorization.