An empirical survey of data augmentation for limited data learning in nlp

J Chen, D Tam, C Raffel, M Bansal… - Transactions of the …, 2023 - direct.mit.edu
NLP has achieved great progress in the past decade through the use of neural models and
large labeled datasets. The dependence on abundant data prevents NLP models from being …

State-of-the-art generalisation research in NLP: a taxonomy and review

D Hupkes, M Giulianelli, V Dankers, M Artetxe… - arXiv preprint arXiv …, 2022 - arxiv.org
The ability to generalise well is one of the primary desiderata of natural language
processing (NLP). Yet, what'good generalisation'entails and how it should be evaluated is …

A survey of automated data augmentation for image classification: Learning to compose, mix, and generate

TH Cheung, DY Yeung - IEEE transactions on neural networks …, 2023 - ieeexplore.ieee.org
Data augmentation is an effective way to improve the generalization of deep learning
models. However, the underlying augmentation methods mainly rely on handcrafted …

Unseen target stance detection with adversarial domain generalization

Z Wang, Q Wang, C Lv, X Cao… - 2020 International Joint …, 2020 - ieeexplore.ieee.org
Although stance detection has made great progress in the past few years, it is still facing the
problem of unseen targets. In this study, we investigate the domain difference between …

A survey of methods for revealing and overcoming weaknesses of data-driven Natural Language Understanding

V Schlegel, G Nenadic… - Natural Language …, 2023 - cambridge.org
Recent years have seen a growing number of publications that analyse Natural Language
Understanding (NLU) datasets for superficial cues, whether they undermine the complexity …

Desiderata for the Context Use of Question Answering Systems

S Shaier, LE Hunter, K von der Wense - arXiv preprint arXiv:2401.18001, 2024 - arxiv.org
Prior work has uncovered a set of common problems in state-of-the-art context-based
question answering (QA) systems: a lack of attention to the context when the latter conflicts …

GradMask: Gradient-guided token masking for textual adversarial example detection

HC Moon, S Joty, X Chi - Proceedings of the 28th ACM SIGKDD …, 2022 - dl.acm.org
We present GradMask, a simple adversarial example detection scheme for natural language
processing (NLP) models. It uses gradient signals to detect adversarially perturbed tokens in …

Toward robust natural language systems

HC Moon - 2023 - dr.ntu.edu.sg
The monumental achievements of deep learning (DL) systems seem to guarantee the
absolute superiority and robustness of modern DL systems, but they have shown significant …

Are multilingual bert models robust? a case study on adversarial attacks for multilingual question answering

S Rosenthal, M Bornea, A Sil - arXiv preprint arXiv:2104.07646, 2021 - arxiv.org
Recent approaches have exploited weaknesses in monolingual question answering (QA)
models by adding adversarial statements to the passage. These attacks caused a reduction …

Contraqa: Question answering under contradicting contexts

L Pan, W Chen, MY Kan, WY Wang - 2021 - openreview.net
With a rise in false, inaccurate, and misleading information in propaganda, news, and social
media, real-world Question Answering (QA) systems face the challenges of synthesizing and …