J Dai, X Pan, R Sun, J Ji, X Xu, M Liu, Y Wang… - arXiv e …, 2023 - ui.adsabs.harvard.edu
With the development of large language models (LLMs), striking a balance between the
performance and safety of AI systems has never been more critical. However, the inherent …