Robust model-free reinforcement learning with multi-objective Bayesian optimization

M Turchetta, A Krause, S Trimpe - 2020 IEEE international …, 2020 - ieeexplore.ieee.org
In reinforcement learning (RL), an autonomous agent learns to perform complex tasks by
maximizing an exogenous reward signal while interacting with its environment. In real world …

Risk-averse offline reinforcement learning

NA Urpí, S Curi, A Krause - arXiv preprint arXiv:2102.05371, 2021 - arxiv.org
Training Reinforcement Learning (RL) agents in high-stakes applications might be too
prohibitive due to the risk associated to exploration. Thus, the agent can only use data …

Barc: Backward reachability curriculum for robotic reinforcement learning

B Ivanovic, J Harrison, A Sharma… - … on Robotics and …, 2019 - ieeexplore.ieee.org
Model-free Reinforcement Learning (RL) offers an attractive approach to learn control
policies for high dimensional systems, but its relatively poor sample complexity often …

Towards tractable optimism in model-based reinforcement learning

A Pacchiano, P Ball, J Parker-Holder… - Uncertainty in …, 2021 - proceedings.mlr.press
The principle of optimism in the face of uncertainty is prevalent throughout sequential
decision making problems such as multi-armed bandits and reinforcement learning (RL). To …

Robust reinforcement learning via genetic curriculum

Y Song, J Schneider - 2022 International Conference on …, 2022 - ieeexplore.ieee.org
Achieving robust performance is crucial when applying deep reinforcement learning (RL) in
safety critical systems. Some of the state of the art approaches try to address the problem …

Lyapunov design for robust and efficient robotic reinforcement learning

T Westenbroek, F Castaneda, A Agrawal… - arXiv preprint arXiv …, 2022 - arxiv.org
Recent advances in the reinforcement learning (RL) literature have enabled roboticists to
automatically train complex policies in simulated environments. However, due to the poor …

Observational overfitting in reinforcement learning

X Song, Y Jiang, S Tu, Y Du, B Neyshabur - arXiv preprint arXiv …, 2019 - arxiv.org
A major component of overfitting in model-free reinforcement learning (RL) involves the case
where the agent may mistakenly correlate reward with certain spurious features from the …

Guided uncertainty-aware policy optimization: Combining learning and model-based strategies for sample-efficient policy learning

MA Lee, C Florensa, J Tremblay… - … on robotics and …, 2020 - ieeexplore.ieee.org
Traditional robotic approaches rely on an accurate model of the environment, a detailed
description of how to perform the task, and a robust perception system to keep track of the …

Evolving rewards to automate reinforcement learning

A Faust, A Francis, D Mehta - arXiv preprint arXiv:1905.07628, 2019 - arxiv.org
Many continuous control tasks have easily formulated objectives, yet using them directly as
a reward in reinforcement learning (RL) leads to suboptimal policies. Therefore, many …

Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees

Y Luo, H Xu, Y Li, Y Tian, T Darrell, T Ma - arXiv preprint arXiv:1807.03858, 2018 - arxiv.org
Model-based reinforcement learning (RL) is considered to be a promising approach to
reduce the sample complexity that hinders model-free RL. However, the theoretical …