Which Melbourne? Augmenting geocoding with maps
View / Open Files
Authors
Gritta, M
Pilehvar, MT
Collier, N
Publication Date
2018Journal Title
ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers)
Conference Name
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
ISBN
9781948087322
Publisher
Association for Computational Linguistics
Volume
1
Pages
1285-1296
Type
Conference Object
Metadata
Show full item recordCitation
Gritta, M., Pilehvar, M., & Collier, N. (2018). Which Melbourne? Augmenting geocoding with maps. ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), 1 1285-1296. https://doi.org/10.18653/v1/p18-1119
Abstract
The purpose of text geolocation is to associate geographic information contained in a document with a set (or sets) of coordinates, either implicitly by using linguistic features and/or explicitly by using geographic metadata combined with heuristics. We introduce a geocoder (location mention disambiguator) that achieves state-of-the-art (SOTA) results on three diverse datasets by exploiting the implicit lexical clues. Moreover, we propose a new method for systematic encoding of geographic metadata to generate two distinct views of the same text. To that end, we introduce the Map Vector (MapVec), a sparse representation obtained by plotting prior geographic probabilities, derived from population figures, on a World Map. We then integrate the implicit (language) and explicit (map) features to significantly improve a range of metrics. We also introduce an open-source dataset for geoparsing of news events covering global disease outbreaks and epidemics to help future evaluation in geoparsing.
Sponsorship
Natural Environment Research Council (1649558)
Engineering and Physical Sciences Research Council (EP/M005089/1)
NERC (via Cranfield University) (NE/M009009/1)
Identifiers
External DOI: https://doi.org/10.18653/v1/p18-1119
This record's URL: https://www.repository.cam.ac.uk/handle/1810/280425
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk