A Turner, L Thiergart, D Udell,
G Leech, U Mini… - arXiv preprint arXiv …, 2023 - arxiv.org
Reliably controlling the behavior of large language models (LLMs) is a pressing open
problem. Existing methods include supervised finetuning, reinforcement learning from …