Improving QA generalization by concurrent modeling of multiple biases

D Hupkes, M Giulianelli, V Dankers, M Artetxe… - arXiv preprint arXiv …, 2022 - arxiv.org

The ability to generalise well is one of the primary desiderata of natural language
processing (NLP). Yet, what'good generalisation'entails and how it should be evaluated is …

被引用次数：58 相关文章所有 7 个版本

[PDF] arxiv.org

Towards debiasing NLU models from unknown biases

PA Utama, NS Moosavi, I Gurevych - arXiv preprint arXiv:2009.12303, 2020 - arxiv.org

NLU models often exploit biases to achieve high dataset-specific performance without
properly learning the intended task. Recently proposed debiasing methods are shown to be …

被引用次数：146 相关文章所有 4 个版本

[PDF] arxiv.org

A survey on measuring and mitigating reasoning shortcuts in machine reading comprehension

X Ho, JM Meissner, S Sugawara, A Aizawa - arXiv preprint arXiv …, 2022 - arxiv.org

The issue of shortcut learning is widely known in NLP and has been an important research
focus in recent years. Unintended correlations in the data enable models to easily solve …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Generalized but not robust? comparing the effects of data modification methods on out-of-domain generalization and adversarial robustness

T Gokhale, S Mishra, M Luo, BS Sachdeva… - arXiv preprint arXiv …, 2022 - arxiv.org

Data modification, either via additional training datasets, data augmentation, debiasing, and
dataset filtering, has been proposed as an effective solution for generalizing to out-of …

被引用次数：31 相关文章所有 5 个版本

[PDF] arxiv.org

Quantifying and attributing the hallucination of large language models via association analysis

L Du, Y Wang, X Xing, Y Ya, X Li, X Jiang… - arXiv preprint arXiv …, 2023 - arxiv.org

Although demonstrating superb performance on various NLP tasks, large language models
(LLMs) still suffer from the hallucination problem, which threatens the reliability of LLMs. To …

被引用次数：10 相关文章所有 2 个版本

[PDF] aaai.org

Which shortcut solution do question answering models prefer to learn?

K Shinoda, S Sugawara, A Aizawa - … of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

Question answering (QA) models for reading comprehension tend to exploit spurious
correlations in training sets and thus learn shortcut solutions rather than the solutions …

被引用次数：5 相关文章所有 4 个版本

[PDF] arxiv.org

An empirical study on model-agnostic debiasing strategies for robust natural language inference

T Liu, X Zheng, X Ding, B Chang, Z Sui - arXiv preprint arXiv:2010.03777, 2020 - arxiv.org

The prior work on natural language inference (NLI) debiasing mainly targets at one or few
known biases while not necessarily making the models more robust. In this paper, we focus …

被引用次数：17 相关文章所有 3 个版本

[PDF] arxiv.org

Smoa: Sparse mixture of adapters to mitigate multiple dataset biases

Y Liu, J Yan, Y Chen, J Liu, H Wu - arXiv preprint arXiv:2302.14413, 2023 - arxiv.org

Recent studies reveal that various biases exist in different NLP tasks, and over-reliance on
biases results in models' poor generalization ability and low adversarial robustness. To …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

Methods for Estimating and improving robustness of language models

M Štefánik - arXiv preprint arXiv:2206.08446, 2022 - arxiv.org

Despite their outstanding performance, large language models (LLMs) suffer notorious flaws
related to their preference for simple, surface-level textual relations over full semantic …

被引用次数：3 相关文章所有 6 个版本

[PDF] arxiv.org

Coreference reasoning in machine reading comprehension

M Wu, NS Moosavi, D Roth, I Gurevych - arXiv preprint arXiv:2012.15573, 2020 - arxiv.org

Coreference resolution is essential for natural language understanding and has been long
studied in NLP. In recent years, as the format of Question Answering (QA) became a …

被引用次数：7 相关文章所有 6 个版本

高级搜索

QQ 群