Entangled preferences: The history and risks of reinforcement learning and human feedback

N Lambert, TK Gilbert, T Zick - arXiv preprint arXiv:2310.13595, 2023 - arxiv.org
Reinforcement learning from human feedback (RLHF) has emerged as a powerful technique
to make large language models (LLMs) easier to use and more effective. A core piece of the …

[PDF][PDF] Investigating the Effects of External Communication and Platoon Behavior on Manual Drivers at Highway Access

M COLLEY, O RAJABI, E RUKZIO - 2024 - uni-ulm.de
1 INTRODUCTION Automated vehicles (AVs) will change traffic both for vulnerable road
users (VRUs) such as pedestrians [17, 30, 37] as well as for manual drivers [11, 57, 58] …