GeoDict: an integrated gazetteer

Nowadays, spatial analysis in text is widely considered as important for both researchers and users. In certain fields such as epidemiology, the extraction of spatial information in text is crucial and both resources and methods are necessary. In most of spatial analysis process, gazetteer is a commonly used resource. A gazetteer is a data source where toponyms (place name) are associated with concepts and their geographic footprint. Unfortunately, most of publicly available gazetteer are incomplete due to their initial purpose. Hence, we propose Geodict, an integrated gazetteer that contains basic yet precise information (multilingual labels, administrative boundaries polygon, etc.) which can be customized. We show its utility when using it for geoparsing (extraction of spatial entities in text). Early evaluation on toponym resolution shows promising results.

Saved in:
Bibliographic Details
Main Authors: Fize, Jacques, Shrivastava, Gaurav
Format: conference_item biblioteca
Language:eng
Published: Association for Computational Linguistics
Subjects:C30 - Documentation et information, B10 - Géographie,
Online Access:http://agritrop.cirad.fr/586514/
http://agritrop.cirad.fr/586514/1/geodict2017.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!