R Liu, F Bai, Y Du, Y Yang - … of the 36th International Conference on …, 2022 - dl.acm.org
Setting up a well-designed reward function has been challenging for many reinforcement
learning applications. Preference-based reinforcement learning (PbRL) provides a new …