Discgen: A framework for discourse-informed counterspeech generation

H Bonaldi, YL Chung, G Abercrombie… - arXiv preprint arXiv …, 2024 - arxiv.org

In recent years, counterspeech has emerged as one of the most promising strategies to fight
online hate. These non-escalatory responses tackle online abuse while preserving the …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios

S Hassan, A Sicilia, M Alikhani - arXiv preprint arXiv:2410.11114, 2024 - arxiv.org

Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing
systems. While Large Language Models (LLMs) can generate valuable data for safety …

被引用次数：1 相关文章所有 3 个版本

[PDF] acm.org

Counterspeakers' Perspectives: Unveiling Barriers and AI Needs in the Fight against Online Hate

J Mun, C Buerger, JT Liang, J Garland… - Proceedings of the CHI …, 2024 - dl.acm.org

Counterspeech, ie, direct responses against hate speech, has become an important tool to
address the increasing amount of hate online while avoiding censorship. Although AI has …

被引用次数：5 相关文章所有 5 个版本

[PDF] aclanthology.org

Contextualized graph representations for generating counter-narratives against hate speech

SB Santamaria, H Gómez-Adorno… - Findings of the …, 2024 - aclanthology.org

Hate speech (HS) is a widely acknowledged societal problem with potentially grave effects
on vulnerable individuals and minority groups. Developing counter-narratives (CNs) that …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents

S Hassan, HY Chung, XZ Tan, M Alikhani - arXiv preprint arXiv …, 2024 - arxiv.org

When assisting people in daily tasks, robots need to accurately interpret visual cues and
respond effectively in diverse safety-critical situations, such as sharp objects on the floor. In …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Assessing the human likeness of AI-generated counterspeech

X Song, S Mamidisetty, E Blanco, L Hong - arXiv preprint arXiv …, 2024 - arxiv.org

Counterspeech is a targeted response to counteract and challenge abusive or hateful
content. It can effectively curb the spread of hatred and foster constructive online …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Contextualized Counterspeech: Strategies for Adaptation, Personalization, and Evaluation

L Cima, A Miaschi, A Trujillo, M Avvenuti… - arXiv preprint arXiv …, 2024 - arxiv.org

AI-generated counterspeech offers a promising and scalable strategy to curb online toxicity
through direct replies that promote civil discourse. However, current counterspeech is one …

CounterQuill: Investigating the Potential of Human-AI Collaboration in Online Counterspeech Writing

X Ding, K Ping, US Gunturi, B Carik, S Stil… - arXiv preprint arXiv …, 2024 - arxiv.org

Online hate speech has become increasingly prevalent on social media platforms, causing
harm to individuals and society. While efforts have been made to combat this issue through …

高级搜索

QQ 群