EvIL: Evolution Strategies for Generalisable Imitation Learning

S Sapora, G Swamy, C Lu, YW Teh… - arXiv preprint arXiv …, 2024 - arxiv.org
Often times in imitation learning (IL), the environment we collect expert demonstrations in
and the environment we want to deploy our learned policy in aren't exactly the same (eg …

Bootstrapped Reward Shaping

J Adamczyk, V Makarenko, S Tiomkin… - arXiv preprint arXiv …, 2025 - arxiv.org
In reinforcement learning, especially in sparse-reward domains, many environment steps
are required to observe reward information. In order to increase the frequency of such …