Detecting hate speech on the world wide web

F Poletto, V Basile, M Sanguinetti, C Bosco… - Language Resources …, 2021 - Springer

Hate Speech in social media is a complex phenomenon, whose detection has recently
gained significant traction in the Natural Language Processing community, as attested by …

被引用次数：418 相关文章所有 12 个版本

[HTML] mdpi.com

[HTML][HTML] A literature review of textual hate speech detection methods and datasets

F Alkomah, X Ma - Information, 2022 - mdpi.com

Online toxic discourses could result in conflicts between groups or harm to online
communities. Hate speech is complex and multifaceted harmful or offensive content …

被引用次数：102 相关文章所有 4 个版本

[PDF] acm.org

Taxonomy of risks posed by language models

L Weidinger, J Uesato, M Rauh, C Griffin… - Proceedings of the …, 2022 - dl.acm.org

Responsible innovation on large-scale Language Models (LMs) requires foresight into and
in-depth understanding of the risks these models may pose. This paper develops a …

被引用次数：398 相关文章所有 7 个版本

[PDF] mit.edu

Dealing with disagreements: Looking beyond the majority vote in subjective annotations

AM Davani, M Díaz, V Prabhakaran - Transactions of the Association …, 2022 - direct.mit.edu

Majority voting and averaging are common approaches used to resolve annotator
disagreements and derive single ground truth labels from multiple annotations. However …

被引用次数：255 相关文章所有 9 个版本

[PDF] aclanthology.org

Challenges in detoxifying language models

J Welbl, A Glaese, J Uesato, S Dathathri… - arXiv preprint arXiv …, 2021 - arxiv.org

Large language models (LM) generate remarkably fluent text and can be efficiently adapted
across NLP tasks. Measuring and guaranteeing the quality of generated text in terms of …

被引用次数：175 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] A systematic review of hate speech automatic detection using natural language processing

MS Jahan, M Oussalah - Neurocomputing, 2023 - Elsevier

With the multiplication of social media platforms, which offer anonymity, easy access and
online community formation and online debate, the issue of hate speech detection and …

被引用次数：199 相关文章所有 8 个版本

[PDF] arxiv.org

Latent hatred: A benchmark for understanding implicit hate speech

M ElSherief, C Ziems, D Muchlinski, V Anupindi… - arXiv preprint arXiv …, 2021 - arxiv.org

Hate speech has grown significantly on social media, causing serious consequences for
victims of all demographics. Despite much attention being paid to characterize and detect …

被引用次数：156 相关文章所有 8 个版本

[PDF] arxiv.org

HateCheck: Functional tests for hate speech detection models

P Röttger, B Vidgen, D Nguyen, Z Waseem… - arXiv preprint arXiv …, 2020 - arxiv.org

Detecting online hate is a difficult task that even state-of-the-art models struggle with.
Typically, hate speech detection models are evaluated by measuring their performance on …

被引用次数：210 相关文章所有 8 个版本

[PDF] researchgate.net

[图书][B] Custodians of the Internet: Platforms, content moderation, and the hidden decisions that shape social media

T Gillespie - 2018 - books.google.com

A revealing and gripping investigation into how social media platforms police what we post
online—and the large societal impact of these decisions Most users want their Twitter feed …

被引用次数：2518 相关文章所有 7 个版本

[PDF] inesctec.pt

A survey on automatic detection of hate speech in text

P Fortuna, S Nunes - ACM Computing Surveys (CSUR), 2018 - dl.acm.org

The scientific study of hate speech, from a computer science point of view, is recent. This
survey organizes and describes the current state of the field, providing a structured overview …

被引用次数：1173 相关文章所有 4 个版本

高级搜索

QQ 群