Empirical analysis of multi-task learning for reducing model bias in toxic comment detection

W Yin, A Zubiaga - PeerJ Computer Science, 2021 - peerj.com

Hate speech is one type of harmful online content which directly attacks or promotes hate
towards a group or an individual member based on their actual or perceived aspects of …

被引用次数：166 相关文章所有 12 个版本

[PDF] arxiv.org

Language (technology) is power: A critical survey of" bias" in nlp

SL Blodgett, S Barocas, H Daumé III… - arXiv preprint arXiv …, 2020 - arxiv.org

We survey 146 papers analyzing" bias" in NLP systems, finding that their motivations are
often vague, inconsistent, and lacking in normative reasoning, despite the fact that …

被引用次数：1028 相关文章所有 6 个版本

[PDF] arxiv.org

[图书][B] Challenges in automated debiasing for toxic language detection

X Zhou - 2021 - search.proquest.com

Biased associations have been a challenge in the development of classifiers for detecting
toxic language, hindering both fairness and accuracy. As potential solutions, we investigate …

被引用次数：130 相关文章所有 12 个版本

[PDF] arxiv.org

Demoting racial bias in hate speech detection

M Xia, A Field, Y Tsvetkov - arXiv preprint arXiv:2005.12246, 2020 - arxiv.org

In current hate speech datasets, there exists a high correlation between annotators'
perceptions of toxicity and signals of African American English (AAE). This bias in annotated …

被引用次数：97 相关文章所有 7 个版本

[PDF] arxiv.org

Cyberbullying detection with fairness constraints

O Gencoglu - IEEE Internet Computing, 2020 - ieeexplore.ieee.org

Cyberbullying is a widespread adverse phenomenon among online social interactions in
today's digital society. While numerous computational studies focus on enhancing the …

被引用次数：65 相关文章所有 7 个版本

[PDF] acm.org

Differential tweetment: Mitigating racial dialect bias in harmful tweet detection

A Ball-Burack, MSA Lee, J Cobbe, J Singh - Proceedings of the 2021 …, 2021 - dl.acm.org

Automated systems for detecting harmful social media content are afflicted by a variety of
biases, some of which originate in their training datasets. In particular, some systems have …

被引用次数：42 相关文章所有 3 个版本

[PDF] arxiv.org

Fairness and robustness in invariant learning: A case study in toxicity classification

R Adragna, E Creager, D Madras, R Zemel - arXiv preprint arXiv …, 2020 - arxiv.org

Robustness is of central importance in machine learning and has given rise to the fields of
domain generalization and invariant learning, which are concerned with improving …

被引用次数：45 相关文章所有 2 个版本

[PDF] arxiv.org

A benchmark study of the contemporary toxicity detectors on software engineering interactions

J Sarker, AK Turzo, A Bosu - 2020 27th Asia-Pacific Software …, 2020 - ieeexplore.ieee.org

Automated filtering of toxic conversations may help an Open-source software (OSS)
community to maintain healthy interactions among the project participants. Although, several …

被引用次数：25 相关文章所有 8 个版本

[PDF] umass.edu

Sociolinguistically driven approaches for just natural language processing

SL Blodgett - 2021 - scholarworks.umass.edu

Natural language processing (NLP) systems are now ubiquitous. Yet the benefits of these
language technologies do not accrue evenly to all users, and indeed they can be harmful; …

被引用次数：26 相关文章所有 4 个版本

Analyzing and addressing the difference in toxicity prediction between different comments with same semantic meaning in google's perspective api

SK Gargee, PB Gopinath, SRSR Kancharla… - ICT Systems and …, 2022 - Springer

Social media has become an essential facet of modern society and is quickly becoming
indispensable. The ubiquity of social media, while having its benefits, also opens up people …

被引用次数：8 相关文章所有 2 个版本

高级搜索

QQ 群