Paper: Supervised Text-based Geolocation Using Language Models on an Adaptive Grid

ACL ID D12-1137
Title Supervised Text-based Geolocation Using Language Models on an Adaptive Grid
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2012
Authors

The geographical properties of words have re- cently begun to be exploited for geolocating documents based solely on their text, often in the context of social media and online content. One common approach for geolocating texts is rooted in information retrieval. Given training documents labeled with latitude/longitude co- ordinates, a grid is overlaid on the Earth and pseudo-documents constructed by concatenat- ing the documents within a given grid cell; then a location for a test document is chosen based on the most similar pseudo-document. Uniform grids are normally used, but they are sensitive to the dispersion of documents over the earth. We define an alternative grid con- struction using k-d trees that more robustly adapts to data, especially with larger training sets. We also provid...