Multilingual content moderation: A case study on Reddit

L Qin, Q Chen, Y Zhou, Z Chen, Y Li, L Liao… - arXiv preprint arXiv …, 2024 - arxiv.org

Multilingual Large Language Models are capable of using powerful Large Language
Models to handle and respond to queries in multiple languages, which achieves remarkable …

被引用次数：54 相关文章所有 2 个版本

[PDF] arxiv.org

Demonstrations are all you need: Advancing offensive content paraphrasing using in-context learning

A Som, K Sikka, H Gent, A Divakaran, A Kathol… - arXiv preprint arXiv …, 2023 - arxiv.org

Paraphrasing of offensive content is a better alternative to content removal and helps
improve civility in a communication environment. Supervised paraphrasers; however, rely …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Benchmarking LLM Guardrails in Handling Multilingual Toxicity

Y Yang, S Dan, D Roth, I Lee - arXiv preprint arXiv:2410.22153, 2024 - arxiv.org

With the ubiquity of Large Language Models (LLMs), guardrails have become crucial to
detect and defend against toxic content. However, with the increasing pervasiveness of …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios

S Hassan, A Sicilia, M Alikhani - arXiv preprint arXiv:2410.11114, 2024 - arxiv.org

Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing
systems. While Large Language Models (LLMs) can generate valuable data for safety …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents

S Hassan, HY Chung, XZ Tan, M Alikhani - arXiv preprint arXiv …, 2024 - arxiv.org

When assisting people in daily tasks, robots need to accurately interpret visual cues and
respond effectively in diverse safety-critical situations, such as sharp objects on the floor. In …

被引用次数：1 相关文章所有 2 个版本

[PDF] cell.com Full View

A survey of multilingual large language models

L Qin, Q Chen, Y Zhou, Z Chen, Y Li, L Liao, M Li… - Patterns, 2025 - cell.com

Multilingual large language models (MLLMs) leverage advanced large language models to
process and respond to queries across multiple languages, achieving significant success in …

[PDF][PDF] Discgen: A framework for discourse-informed counterspeech generation

S Hassan, M Alikhani - Proceedings of the 13th International Joint …, 2023 - aclanthology.org

Counterspeech can be an effective method for battling hateful content on social media.
Automated counterspeech generation can aid in this process. Generated counterspeech …

被引用次数：8 相关文章所有 3 个版本

[PDF] koustuv.com

LLM-Mod: Can Large Language Models Assist Content Moderation?

M Kolla, S Salunkhe, E Chandrasekharan… - Extended Abstracts of the …, 2024 - dl.acm.org

Content moderation is critical for maintaining healthy online spaces. However, it remains a
predominantly manual task. Moderators are often exhausted by low moderator-to-posts ratio …

被引用次数：24 相关文章所有 3 个版本

[PDF] arxiv.org

D-CALM: A dynamic clustering-based active learning approach for mitigating bias

S Hassan, M Alikhani - arXiv preprint arXiv:2305.17013, 2023 - arxiv.org

Despite recent advancements, NLP models continue to be vulnerable to bias. This bias often
originates from the uneven distribution of real-world data and can propagate through the …

被引用次数：9 相关文章所有 4 个版本

[PDF] acm.org

Integrating Content Moderation Systems with Large Language Models

M Franco, O Gaggi, CE Palazzi - ACM Transactions on the Web, 2024 - dl.acm.org

Online Social Networks (OSNs) rely on content moderation systems to ensure platform and
user safety by preventing malicious activities, like the spread of harmful content. However …

高级搜索

QQ 群