Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements

J Deng, J Cheng, H Sun, Z Zhang, M Huang - arXiv preprint arXiv …, 2023 - arxiv.org
As generative large model capabilities advance, safety concerns become more pronounced
in their outputs. To ensure the sustainable growth of the AI ecosystem, it's imperative to …

Improving Zero-Shot Stance Detection by Infusing Knowledge from Large Language Models

M Guo, X Jiang, Y Liao - International Conference on Intelligent …, 2024 - Springer
Abstract The Zero-shot Stance Detection (ZSSD) task is designed to predict someone's
attitude towards unseen targets with limited training data. However, existing methods often …

Chinese Offensive Language Detection Algorithm based on Pre-trained Language model and Pointer Network Augmentation

B Hou, X Xie, D Zhang, L Zheng… - 2024 5th International …, 2024 - ieeexplore.ieee.org
The traditional offensive language detection methods suffer from issues such as inadequate
understanding of semantic information and sensitivity to noise in text. To address these …