Paper: Exploiting Language Models for Visual Recognition

ACL ID D13-1072
Title Exploiting Language Models for Visual Recognition
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2013

The problem of learning language models from large text corpora has been widely stud- ied within the computational linguistic com- munity. However, little is known about the performance of these language models when applied to the computer vision domain. In this work, we compare representative models: a window-based model, a topic model, a distri- butional memory and a commonsense knowl- edge database, ConceptNet, in two visual recognition scenarios: human action recog- nition and object prediction. We examine whether the knowledge extracted from texts through these models are compatible to the knowledge represented in images. We de- termine the usefulness of different language models in aiding the two visual recognition tasks. The study shows that the language models built from general te...