Dara: Dynamics-aware reward augmentation in offline reinforcement learning

J Liu, H Zhang, D Wang - arXiv preprint arXiv:2203.06662, 2022 - arxiv.org
Offline reinforcement learning algorithms promise to be applicable in settings where a fixed
dataset is available and no new experience can be acquired. However, such formulation is …

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

R Wei, N Lambert, A McDonald, A Garcia… - arXiv preprint arXiv …, 2023 - arxiv.org
Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient,
adaptive, and explainable by learning an explicit model of the environment. While the …