MA Bakker, MJ Chadwick, HR Sheahan… - Proceedings of the 36th …, 2022 - dl.acm.org
Recent work in large language modeling (LLMs) has used fine-tuning to align outputs with
the preferences of a prototypical user. This work assumes that human preferences are static …