查看文章

arxiv.org 中的 [PDF]

Societal biases in language generation: Progress and challenges

作者

Emily Sheng, Kai-Wei Chang, Premkumar Natarajan, Nanyun Peng

发表日期

2021/5/10

来源

arXiv preprint arXiv:2105.04054

简介

Technology for language generation has advanced rapidly, spurred by advancements in pre-training large models on massive amounts of data and the need for intelligent agents to communicate in a natural manner. While techniques can effectively generate fluent text, they can also produce undesirable societal biases that can have a disproportionately negative impact on marginalized populations. Language generation presents unique challenges for biases in terms of direct user interaction and the structure of decoding techniques. To better understand these challenges, we present a survey on societal biases in language generation, focusing on how data and techniques contribute to biases and progress towards reducing biases. Motivated by a lack of studies on biases from decoding techniques, we also conduct experiments to quantify the effects of these techniques. By further discussing general trends and open challenges, we call to attention promising directions for research and the importance of fairness and inclusivity considerations for language generation applications.

引用总数

被引用次数：161

202120222023202411 42 68 40

学术搜索中的文章

Societal biases in language generation: Progress and challenges

E Sheng, KW Chang, P Natarajan, N Peng - arXiv preprint arXiv:2105.04054, 2021

被引用次数：161 相关文章所有 4 个版本