Location reference recognition from texts: A survey and comparison

X Hu, Z Zhou, H Li, Y Hu, F Gu, J Kersten, H Fan… - ACM Computing …, 2023 - dl.acm.org
A vast amount of location information exists in unstructured texts, such as social media
posts, news stories, scientific articles, web pages, travel blogs, and historical archives …

Named entity recognition and classification in historical documents: A survey

M Ehrmann, A Hamdi, EL Pontes, M Romanello… - ACM Computing …, 2023 - dl.acm.org
After decades of massive digitisation, an unprecedented number of historical documents are
available in digital format, along with their machine-readable texts. While this represents a …

Named entity recognition and classification on historical documents: A survey

M Ehrmann, A Hamdi, EL Pontes, M Romanello… - arXiv preprint arXiv …, 2021 - arxiv.org
After decades of massive digitisation, an unprecedented amount of historical documents is
available in digital format, along with their machine-readable texts. While this represents a …

Named entity recognition goes to old regime France: geographic text analysis for early modern French corpora

K McDonough, L Moncla… - International Journal of …, 2019 - Taylor & Francis
Geographic text analysis (GTA) research in the digital humanities has focused on projects
analyzing modern English-language corpora. These projects depend on temporally specific …

[HTML][HTML] AGORA: An intelligent system for the anonymization, information extraction and automatic mapping of sensitive documents

R Juez-Hernandez, L Quijano-Sánchez… - Applied Soft …, 2023 - Elsevier
Public institutions, such as law enforcement agencies or health centers, have a vast volume
of unstructured text documents, eg police reports. Currently, before this data can be shared …

Geospatial knowledge in housing advertisements: Capturing and extracting spatial information from text

L Cadorel, A Blanchi, AGB Tettamanzi - Proceedings of the 11th …, 2021 - dl.acm.org
Information of the geographical and spatial type is found in numerous text documents and
constitutes a very challenging target for extraction. Geoparsing applications have been …

Reconstruction of itineraries from annotated text with an informed spanning tree algorithm

L Moncla, M Gaio, J Nogueras-Iso… - International Journal of …, 2016 - Taylor & Francis
Considerable amounts of geographical data are still collected not in form of GIS data but just
as natural language texts. This paper proposes an approach for the automatic geocoding of …

Geoparsing historical and contemporary literary text set in the City of Edinburgh

B Alex, C Grover, R Tobin, J Oberlander - Language Resources and …, 2019 - Springer
While a reasonable amount of work has gone into automatically geoparsing text at the city or
higher levels of granularity for different types of texts in different domains, there is relatively …

Mlm: a benchmark dataset for multitask learning with multiple languages and modalities

J Armitage, E Kacupaj, G Tahmasebzadeh… - Proceedings of the 29th …, 2020 - dl.acm.org
In this paper, we introduce the MLM (Multiple Languages and Modalities) dataset-a new
resource to train and evaluate multitask systems on samples in multiple modalities and three …

Mapping urban fingerprints of odonyms automatically extracted from French novels

L Moncla, M Gaio, T Joliveau, YF Le Lay… - International Journal …, 2019 - Taylor & Francis
In this paper, we propose and discuss a methodology to map the spatial fingerprints of
novels and authors based on all of the named urban roads (ie, odonyms) extracted from …