Towards generalisable hate speech detection: a review on obstacles and solutions

W Yin, A Zubiaga - PeerJ Computer Science, 2021 - peerj.com
Hate speech is one type of harmful online content which directly attacks or promotes hate
towards a group or an individual member based on their actual or perceived aspects of …

Language (technology) is power: A critical survey of" bias" in nlp

SL Blodgett, S Barocas, H Daumé III… - arXiv preprint arXiv …, 2020 - arxiv.org
We survey 146 papers analyzing" bias" in NLP systems, finding that their motivations are
often vague, inconsistent, and lacking in normative reasoning, despite the fact that …

[图书][B] Challenges in automated debiasing for toxic language detection

X Zhou - 2021 - search.proquest.com
Biased associations have been a challenge in the development of classifiers for detecting
toxic language, hindering both fairness and accuracy. As potential solutions, we investigate …

Demoting racial bias in hate speech detection

M Xia, A Field, Y Tsvetkov - arXiv preprint arXiv:2005.12246, 2020 - arxiv.org
In current hate speech datasets, there exists a high correlation between annotators'
perceptions of toxicity and signals of African American English (AAE). This bias in annotated …

Cyberbullying detection with fairness constraints

O Gencoglu - IEEE Internet Computing, 2020 - ieeexplore.ieee.org
Cyberbullying is a widespread adverse phenomenon among online social interactions in
today's digital society. While numerous computational studies focus on enhancing the …

Differential tweetment: Mitigating racial dialect bias in harmful tweet detection

A Ball-Burack, MSA Lee, J Cobbe, J Singh - Proceedings of the 2021 …, 2021 - dl.acm.org
Automated systems for detecting harmful social media content are afflicted by a variety of
biases, some of which originate in their training datasets. In particular, some systems have …

Fairness and robustness in invariant learning: A case study in toxicity classification

R Adragna, E Creager, D Madras, R Zemel - arXiv preprint arXiv …, 2020 - arxiv.org
Robustness is of central importance in machine learning and has given rise to the fields of
domain generalization and invariant learning, which are concerned with improving …

A benchmark study of the contemporary toxicity detectors on software engineering interactions

J Sarker, AK Turzo, A Bosu - 2020 27th Asia-Pacific Software …, 2020 - ieeexplore.ieee.org
Automated filtering of toxic conversations may help an Open-source software (OSS)
community to maintain healthy interactions among the project participants. Although, several …

Sociolinguistically driven approaches for just natural language processing

SL Blodgett - 2021 - scholarworks.umass.edu
Natural language processing (NLP) systems are now ubiquitous. Yet the benefits of these
language technologies do not accrue evenly to all users, and indeed they can be harmful; …

Analyzing and addressing the difference in toxicity prediction between different comments with same semantic meaning in google's perspective api

SK Gargee, PB Gopinath, SRSR Kancharla… - ICT Systems and …, 2022 - Springer
Social media has become an essential facet of modern society and is quickly becoming
indispensable. The ubiquity of social media, while having its benefits, also opens up people …