NLP for Counterspeech against Hate: A Survey and How-To Guide

H Bonaldi, YL Chung, G Abercrombie… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, counterspeech has emerged as one of the most promising strategies to fight
online hate. These non-escalatory responses tackle online abuse while preserving the …

Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios

S Hassan, A Sicilia, M Alikhani - arXiv preprint arXiv:2410.11114, 2024 - arxiv.org
Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing
systems. While Large Language Models (LLMs) can generate valuable data for safety …

Counterspeakers' Perspectives: Unveiling Barriers and AI Needs in the Fight against Online Hate

J Mun, C Buerger, JT Liang, J Garland… - Proceedings of the CHI …, 2024 - dl.acm.org
Counterspeech, ie, direct responses against hate speech, has become an important tool to
address the increasing amount of hate online while avoiding censorship. Although AI has …

Contextualized graph representations for generating counter-narratives against hate speech

SB Santamaria, H Gómez-Adorno… - Findings of the …, 2024 - aclanthology.org
Hate speech (HS) is a widely acknowledged societal problem with potentially grave effects
on vulnerable individuals and minority groups. Developing counter-narratives (CNs) that …

Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents

S Hassan, HY Chung, XZ Tan, M Alikhani - arXiv preprint arXiv …, 2024 - arxiv.org
When assisting people in daily tasks, robots need to accurately interpret visual cues and
respond effectively in diverse safety-critical situations, such as sharp objects on the floor. In …

Assessing the human likeness of AI-generated counterspeech

X Song, S Mamidisetty, E Blanco, L Hong - arXiv preprint arXiv …, 2024 - arxiv.org
Counterspeech is a targeted response to counteract and challenge abusive or hateful
content. It can effectively curb the spread of hatred and foster constructive online …

Contextualized Counterspeech: Strategies for Adaptation, Personalization, and Evaluation

L Cima, A Miaschi, A Trujillo, M Avvenuti… - arXiv preprint arXiv …, 2024 - arxiv.org
AI-generated counterspeech offers a promising and scalable strategy to curb online toxicity
through direct replies that promote civil discourse. However, current counterspeech is one …

CounterQuill: Investigating the Potential of Human-AI Collaboration in Online Counterspeech Writing

X Ding, K Ping, US Gunturi, B Carik, S Stil… - arXiv preprint arXiv …, 2024 - arxiv.org
Online hate speech has become increasingly prevalent on social media platforms, causing
harm to individuals and society. While efforts have been made to combat this issue through …