J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv e …, 2023 - ui.adsabs.harvard.edu
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …