Low-resource languages: A review of past work and future challenges

A Magueresse, V Carles, E Heetderks - arXiv preprint arXiv:2006.07264, 2020 - arxiv.org
A current problem in NLP is massaging and processing low-resource languages which lack
useful training attributes such as supervised data, number of native speakers or experts, etc …

Neural entity linking: A survey of models based on deep learning

Ö Sevgili, A Shelmanov, M Arkhipov… - Semantic …, 2022 - content.iospress.com
This survey presents a comprehensive description of recent neural entity linking (EL)
systems developed since 2015 as a result of the “deep learning revolution” in natural …

Entity linking in 100 languages

JA Botha, Z Shan, D Gillick - arXiv preprint arXiv:2011.02690, 2020 - arxiv.org
We propose a new formulation for multilingual entity linking, where language-specific
mentions resolve to a language-agnostic Knowledge Base. We train a dual encoder in this …

Soft gazetteers for low-resource named entity recognition

S Rijhwani, S Zhou, G Neubig, J Carbonell - arXiv preprint arXiv …, 2020 - arxiv.org
Traditional named entity recognition models use gazetteers (lists of entities) as features to
improve performance. Although modern neural network models do not require such hand …

Survey on english entity linking on wikidata: Datasets and approaches

C Möller, J Lehmann, R Usbeck - Semantic Web, 2022 - content.iospress.com
Wikidata is a frequently updated, community-driven, and multilingual knowledge graph.
Hence, Wikidata is an attractive basis for Entity Linking, which is evident by the recent …

Learning from unlabelled data for clinical semantic textual similarity

Y Wang, K Verspoor, T Baldwin - Proceedings of the 3rd Clinical …, 2020 - aclanthology.org
Abstract Domain pretraining followed by task fine-tuning has become the standard paradigm
for NLP tasks, but requires in-domain labelled data for task fine-tuning. To overcome this, we …

Multilingual end to end entity linking

M Plekhanov, N Kassner, K Popat, L Martin… - arXiv preprint arXiv …, 2023 - arxiv.org
Entity Linking is one of the most common Natural Language Processing tasks in practical
applications, but so far efficient end-to-end solutions with multilingual coverage have been …

MELHISSA: a multilingual entity linking architecture for historical press articles

E Linhares Pontes, LA Cabrera-Diego… - International journal on …, 2022 - Springer
Digital libraries have a key role in cultural heritage as they provide access to our culture and
history by indexing books and historical documents (newspapers and letters). Digital …

The Zeno's Paradox ofLow-Resource'Languages

HH Nigatu, AL Tonja, B Rosman, T Solorio… - arXiv preprint arXiv …, 2024 - arxiv.org
The disparity in the languages commonly studied in Natural Language Processing (NLP) is
typically reflected by referring to languages as low vs high-resourced. However, there is …

[PDF][PDF] HPI-DHC@ BioASQ DisTEMIST: Spanish Biomedical Entity Linking with Pre-trained Transformers and Cross-lingual Candidate Retrieval.

F Borchert, MP Schapranow - CLEF (Working Notes), 2022 - dei.unipd.it
Biomedical named entity recognition and entity linking are important building blocks for
various clinical applications and downstream NLP tasks. In the clinical domain, language …