[PDF][PDF] Overview of the multilingual text detoxification task at pan 2024

D Dementieva, D Moskovskiy, N Babakov… - Working Notes of …, 2024 - ceur-ws.org
Despite different countries and social platform regulations, digital abusive speech persists
as a significant challenge. One of the way to tackle abusive, or more specifically, toxic …

Through the lens of core competency: Survey on evaluation of large language models

Z Zhuang, Q Chen, L Ma, M Li, Y Han, Y Qian… - arXiv preprint arXiv …, 2023 - arxiv.org
From pre-trained language model (PLM) to large language model (LLM), the field of natural
language processing (NLP) has witnessed steep performance gains and wide practical …

Learning interpretable style embeddings via prompting llms

A Patel, D Rao, A Kothary, K McKeown… - arXiv preprint arXiv …, 2023 - arxiv.org
Style representation learning builds content-independent representations of author style in
text. Stylometry, the analysis of style in text, is often performed by expert forensic linguists …

Subtle misogyny detection and mitigation: An expert-annotated dataset

B Sheppard, A Richter, A Cohen, EA Smith… - arXiv preprint arXiv …, 2023 - arxiv.org
Using novel approaches to dataset development, the Biasly dataset captures the nuance
and subtlety of misogyny in ways that are unique within the literature. Built in collaboration …

CMD: a framework for Context-aware Model self-Detoxification

Z Tang, K Zhou, J Li, Y Ding, P Wang… - Proceedings of the …, 2024 - aclanthology.org
Text detoxification aims to minimize the risk of language models producing toxic content.
Existing detoxification methods of directly constraining the model output or further training …

Multilingual content moderation: A case study on Reddit

M Ye, K Sikka, K Atwell, S Hassan, A Divakaran… - arXiv preprint arXiv …, 2023 - arxiv.org
Content moderation is the process of flagging content based on pre-defined platform rules.
There has been a growing need for AI moderators to safeguard users as well as protect the …

COUNT: COntrastive UNlikelihood text style transfer for text detoxification

MMA Pour, P Farinneya, M Bharadwaj… - Findings of the …, 2023 - aclanthology.org
Offensive and toxic text on social media platforms can lead to polarization and divisiveness
within online communities and hinders constructive dialogue. Text detoxification is a crucial …

Demonstrations are all you need: Advancing offensive content paraphrasing using in-context learning

A Som, K Sikka, H Gent, A Divakaran, A Kathol… - arXiv preprint arXiv …, 2023 - arxiv.org
Paraphrasing of offensive content is a better alternative to content removal and helps
improve civility in a communication environment. Supervised paraphrasers; however, rely …

Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios

S Hassan, A Sicilia, M Alikhani - arXiv preprint arXiv:2410.11114, 2024 - arxiv.org
Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing
systems. While Large Language Models (LLMs) can generate valuable data for safety …

Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting

A Yerukola, X Zhou, E Clark, M Sap - arXiv preprint arXiv:2305.14755, 2023 - arxiv.org
Most existing stylistic text rewriting methods and evaluation metrics operate on a sentence
level, but ignoring the broader context of the text can lead to preferring generic, ambiguous …