J Park, Y Seo, J Shin, H Lee, P Abbeel… - … Conference on Learning …, 2022 - pure.kaist.ac.kr
Preference-based reinforcement learning (RL) has shown potential for teaching agents to
perform the target tasks without a costly, pre-defined reward function by learning the reward …