Survey on reinforcement learning for language processing

V Uc-Cetina, N Navarro-Guerrero… - Artificial Intelligence …, 2023 - Springer
In recent years some researchers have explored the use of reinforcement learning (RL)
algorithms as key components in the solution of various natural language processing (NLP) …

A brief overview of universal sentence representation methods: A linguistic view

R Li, X Zhao, MF Moens - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
How to transfer the semantic information in a sentence to a computable numerical
embedding form is a fundamental problem in natural language processing. An informative …

MoverScore: Text generation evaluating with contextualized embeddings and earth mover distance

W Zhao, M Peyrard, F Liu, Y Gao, CM Meyer… - arXiv preprint arXiv …, 2019 - arxiv.org
A robust evaluation metric has a profound impact on the development of text generation
systems. A desirable metric compares system output against references based on their …

Multilingual alignment of contextual word representations

S Cao, N Kitaev, D Klein - arXiv preprint arXiv:2002.03518, 2020 - arxiv.org
We propose procedures for evaluating and strengthening contextual embedding alignment
and show that they are useful in analyzing and improving multilingual BERT. In particular …

Sbert-wk: A sentence embedding method by dissecting bert-based word models

B Wang, CCJ Kuo - IEEE/ACM Transactions on Audio, Speech …, 2020 - ieeexplore.ieee.org
Sentence embedding is an important research topic in natural language processing (NLP)
since it can transfer knowledge to downstream tasks. Meanwhile, a contextualized word …

Beyond mahalanobis distance for textual ood detection

P Colombo, E Dadalto, G Staerman… - Advances in …, 2022 - proceedings.neurips.cc
As the number of AI systems keeps growing, it is fundamental to implement and develop
efficient control mechanisms to ensure the safe and proper functioning of machine learning …

Unsupervised layer-wise score aggregation for textual ood detection

M Darrin, G Staerman, EDC Gomes… - Proceedings of the …, 2024 - ojs.aaai.org
Abstract Out-of-distribution (OOD) detection is a rapidly growing field due to new robustness
and security requirements driven by an increased number of AI-based systems. Existing …

Evaluation of sentence embeddings in downstream and linguistic probing tasks

CS Perone, R Silveira, TS Paula - arXiv preprint arXiv:1806.06259, 2018 - arxiv.org
Despite the fast developmental pace of new sentence embedding methods, it is still
challenging to find comprehensive evaluations of these different techniques. In the past …

Better rewards yield better summaries: Learning to summarise without references

F Böhm, Y Gao, CM Meyer, O Shapira, I Dagan… - arXiv preprint arXiv …, 2019 - arxiv.org
Reinforcement Learning (RL) based document summarisation systems yield state-of-the-art
performance in terms of ROUGE scores, because they directly use ROUGE as the rewards …

Automatic text evaluation through the lens of Wasserstein barycenters

P Colombo, G Staerman, C Clavel… - arXiv preprint arXiv …, 2021 - arxiv.org
A new metric\texttt {BaryScore} to evaluate text generation based on deep contextualized
embeddings eg, BERT, Roberta, ELMo) is introduced. This metric is motivated by a new …