A survey on semantic processing techniques

R Mao, K He, X Zhang, G Chen, J Ni, Z Yang… - Information …, 2024 - Elsevier
Semantic processing is a fundamental research domain in computational linguistics. In the
era of powerful pre-trained language models and large language models, the advancement …

ExtEnD: Extractive entity disambiguation

E Barba, L Procopio, R Navigli - … of the 60th Annual Meeting of the …, 2022 - aclanthology.org
Abstract Local models for Entity Disambiguation (ED) have today become extremely
powerful, in most part thanks to the advent of large pre-trained language models. However …

MultiNERD: A multilingual, multi-genre and fine-grained dataset for named entity recognition (and disambiguation)

S Tedeschi, R Navigli - Findings of the Association for …, 2022 - aclanthology.org
Abstract Named Entity Recognition (NER) is the task of identifying named entities in texts
and classifying them through specific semantic categories, a process which is crucial for a …

[HTML][HTML] Who can verify this? finding authorities for rumor verification in Twitter

F Haouari, T Elsayed, W Mansour - Information Processing & Management, 2023 - Elsevier
A large body of research work has proposed verification techniques for rumors spreading in
social media that mainly relied on subjective evidence, eg, propagation networks or user …

RED: a Filtered and Multilingual Relation Extraction Dataset

PLH Cabot, S Tedeschi, ACN Ngomo… - arXiv preprint arXiv …, 2023 - arxiv.org
Relation Extraction (RE) is a task that identifies relationships between entities in a text,
enabling the acquisition of relational facts and bridging the gap between natural language …

CNER: Concept and Named Entity Recognition

G Martinelli, F Molfese, S Tedeschi… - Proceedings of the …, 2024 - aclanthology.org
Named entities–typically expressed via proper nouns–play a key role in Natural Language
Processing, as their identification and comprehension are crucial in tasks such as Relation …

ID10M: Idiom identification in 10 languages

S Tedeschi, F Martelli, R Navigli - Findings of the Association for …, 2022 - aclanthology.org
Idioms are phrases which present a figurative meaning that cannot be (completely) derived
by looking at the meaning of their individual components. Identifying and understanding …

CleanCoNLL: A Nearly Noise-Free Named Entity Recognition Dataset

S Rücker, A Akbik - arXiv preprint arXiv:2310.16225, 2023 - arxiv.org
The CoNLL-03 corpus is arguably the most well-known and utilized benchmark dataset for
named entity recognition (NER). However, prior works found significant numbers of …

FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research

J Jin, Y Zhu, X Yang, C Zhang, Z Dou - arXiv preprint arXiv:2405.13576, 2024 - arxiv.org
With the advent of Large Language Models (LLMs), the potential of Retrieval Augmented
Generation (RAG) techniques have garnered considerable research attention. Numerous …

Entity disambiguation with entity definitions

L Procopio, S Conia, E Barba, R Navigli - arXiv preprint arXiv:2210.05648, 2022 - arxiv.org
Local models have recently attained astounding performances in Entity Disambiguation
(ED), with generative and extractive formulations being the most promising research …