- 学术资源搜索

A survey on fairness in large language models

Y Li, M Du, R Song, X Wang, Y Wang - arXiv preprint arXiv:2308.10149, 2023 - arxiv.org

Large language models (LLMs) have shown powerful performance and development
prospect and are widely deployed in the real world. However, LLMs can capture social …

被引用次数：83 相关文章所有 2 个版本

[PDF] acm.org

Fairness in deep learning: A survey on vision and language research

O Parraga, MD More, CM Oliveira, NS Gavenski… - ACM Computing …, 2023 - dl.acm.org

Despite being responsible for state-of-the-art results in several computer vision and natural
language processing tasks, neural networks have faced harsh criticism due to some of their …

被引用次数：37 相关文章

[PDF] neurips.cc

Leace: Perfect linear concept erasure in closed form

N Belrose, D Schneider-Joseph… - Advances in …, 2024 - proceedings.neurips.cc

Abstract Concept erasure aims to remove specified features from a representation. It can
improve fairness (eg preventing a classifier from using gender or race) and interpretability …

被引用次数：98 相关文章所有 5 个版本

[PDF] arxiv.org

Harms of gender exclusivity and challenges in non-binary representation in language technologies

S Dev, M Monajatipoor, A Ovalle… - arXiv preprint arXiv …, 2021 - arxiv.org

Gender is widely discussed in the context of language tasks and when examining the
stereotypes propagated by language models. However, current discussions primarily treat …

被引用次数：169 相关文章所有 6 个版本

[PDF] mdpi.com

A survey on bias in deep NLP

I Garrido-Muñoz, A Montejo-Ráez… - Applied Sciences, 2021 - mdpi.com

Deep neural networks are hegemonic approaches to many machine learning areas,
including natural language processing (NLP). Thanks to the availability of large corpora …

被引用次数：212 相关文章所有 10 个版本

[PDF] mlr.press

Linear adversarial concept erasure

S Ravfogel, M Twiton, Y Goldberg… - … on Machine Learning, 2022 - proceedings.mlr.press

Modern neural models trained on textual data rely on pre-trained representations that
emerge without direct supervision. As these representations are increasingly being used in …

被引用次数：89 相关文章所有 6 个版本

[PDF] arxiv.org

Having beer after prayer? measuring cultural bias in large language models

T Naous, MJ Ryan, A Ritter, W Xu - arXiv preprint arXiv:2305.14456, 2023 - arxiv.org

As the reach of large language models (LMs) expands globally, their ability to cater to
diverse cultural contexts becomes crucial. Despite advancements in multilingual …

被引用次数：92 相关文章所有 4 个版本

[PDF] arxiv.org

On measures of biases and harms in NLP

S Dev, E Sheng, J Zhao, A Amstutz, J Sun… - arXiv preprint arXiv …, 2021 - arxiv.org

Recent studies show that Natural Language Processing (NLP) technologies propagate
societal biases about demographic groups associated with attributes such as gender, race …

被引用次数：84 相关文章所有 4 个版本

[PDF] arxiv.org

Risk taxonomy, mitigation, and assessment benchmarks of large language model systems

T Cui, Y Wang, C Fu, Y Xiao, S Li, X Deng, Y Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs) have strong capabilities in solving diverse natural language
processing tasks. However, the safety and security issues of LLM systems have become the …

被引用次数：50 相关文章所有 2 个版本

[PDF] arxiv.org

MISGENDERED: Limits of large language models in understanding pronouns

T Hossain, S Dev, S Singh - arXiv preprint arXiv:2306.03950, 2023 - arxiv.org

Content Warning: This paper contains examples of misgendering and erasure that could be
offensive and potentially triggering. Gender bias in language technologies has been widely …

被引用次数：28 相关文章所有 7 个版本

高级搜索

QQ 群

A survey on fairness in large language models

Fairness in deep learning: A survey on vision and language research

Leace: Perfect linear concept erasure in closed form

Harms of gender exclusivity and challenges in non-binary representation in language technologies

A survey on bias in deep NLP

Linear adversarial concept erasure

Having beer after prayer? measuring cultural bias in large language models

On measures of biases and harms in NLP

Risk taxonomy, mitigation, and assessment benchmarks of large language model systems

MISGENDERED: Limits of large language models in understanding pronouns

引用