[HTML][HTML] Resources and benchmark corpora for hate speech detection: a systematic review

F Poletto, V Basile, M Sanguinetti, C Bosco… - Language Resources …, 2021 - Springer
Hate Speech in social media is a complex phenomenon, whose detection has recently
gained significant traction in the Natural Language Processing community, as attested by …

[HTML][HTML] A literature review of textual hate speech detection methods and datasets

F Alkomah, X Ma - Information, 2022 - mdpi.com
Online toxic discourses could result in conflicts between groups or harm to online
communities. Hate speech is complex and multifaceted harmful or offensive content …

Taxonomy of risks posed by language models

L Weidinger, J Uesato, M Rauh, C Griffin… - Proceedings of the …, 2022 - dl.acm.org
Responsible innovation on large-scale Language Models (LMs) requires foresight into and
in-depth understanding of the risks these models may pose. This paper develops a …

Dealing with disagreements: Looking beyond the majority vote in subjective annotations

AM Davani, M Díaz, V Prabhakaran - Transactions of the Association …, 2022 - direct.mit.edu
Majority voting and averaging are common approaches used to resolve annotator
disagreements and derive single ground truth labels from multiple annotations. However …

Challenges in detoxifying language models

J Welbl, A Glaese, J Uesato, S Dathathri… - arXiv preprint arXiv …, 2021 - arxiv.org
Large language models (LM) generate remarkably fluent text and can be efficiently adapted
across NLP tasks. Measuring and guaranteeing the quality of generated text in terms of …

[HTML][HTML] A systematic review of hate speech automatic detection using natural language processing

MS Jahan, M Oussalah - Neurocomputing, 2023 - Elsevier
With the multiplication of social media platforms, which offer anonymity, easy access and
online community formation and online debate, the issue of hate speech detection and …

Latent hatred: A benchmark for understanding implicit hate speech

M ElSherief, C Ziems, D Muchlinski, V Anupindi… - arXiv preprint arXiv …, 2021 - arxiv.org
Hate speech has grown significantly on social media, causing serious consequences for
victims of all demographics. Despite much attention being paid to characterize and detect …

HateCheck: Functional tests for hate speech detection models

P Röttger, B Vidgen, D Nguyen, Z Waseem… - arXiv preprint arXiv …, 2020 - arxiv.org
Detecting online hate is a difficult task that even state-of-the-art models struggle with.
Typically, hate speech detection models are evaluated by measuring their performance on …

[图书][B] Custodians of the Internet: Platforms, content moderation, and the hidden decisions that shape social media

T Gillespie - 2018 - books.google.com
A revealing and gripping investigation into how social media platforms police what we post
online—and the large societal impact of these decisions Most users want their Twitter feed …

A survey on automatic detection of hate speech in text

P Fortuna, S Nunes - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
The scientific study of hate speech, from a computer science point of view, is recent. This
survey organizes and describes the current state of the field, providing a structured overview …