相关文章- 学术资源搜索

Robust model-free reinforcement learning with multi-objective Bayesian optimization

M Turchetta, A Krause, S Trimpe - 2020 IEEE international …, 2020 - ieeexplore.ieee.org

In reinforcement learning (RL), an autonomous agent learns to perform complex tasks by
maximizing an exogenous reward signal while interacting with its environment. In real world …

被引用次数：42 相关文章所有 5 个版本

[PDF] arxiv.org

Risk-averse offline reinforcement learning

NA Urpí, S Curi, A Krause - arXiv preprint arXiv:2102.05371, 2021 - arxiv.org

Training Reinforcement Learning (RL) agents in high-stakes applications might be too
prohibitive due to the risk associated to exploration. Thus, the agent can only use data …

被引用次数：81 相关文章所有 2 个版本

[PDF] arxiv.org

Barc: Backward reachability curriculum for robotic reinforcement learning

B Ivanovic, J Harrison, A Sharma… - … on Robotics and …, 2019 - ieeexplore.ieee.org

Model-free Reinforcement Learning (RL) offers an attractive approach to learn control
policies for high dimensional systems, but its relatively poor sample complexity often …

被引用次数：69 相关文章所有 5 个版本

[PDF] mlr.press

Towards tractable optimism in model-based reinforcement learning

A Pacchiano, P Ball, J Parker-Holder… - Uncertainty in …, 2021 - proceedings.mlr.press

The principle of optimism in the face of uncertainty is prevalent throughout sequential
decision making problems such as multi-armed bandits and reinforcement learning (RL). To …

被引用次数：14 相关文章所有 8 个版本

[PDF] arxiv.org

Robust reinforcement learning via genetic curriculum

Y Song, J Schneider - 2022 International Conference on …, 2022 - ieeexplore.ieee.org

Achieving robust performance is crucial when applying deep reinforcement learning (RL) in
safety critical systems. Some of the state of the art approaches try to address the problem …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

Lyapunov design for robust and efficient robotic reinforcement learning

T Westenbroek, F Castaneda, A Agrawal… - arXiv preprint arXiv …, 2022 - arxiv.org

Recent advances in the reinforcement learning (RL) literature have enabled roboticists to
automatically train complex policies in simulated environments. However, due to the poor …

被引用次数：16 相关文章所有 5 个版本

[PDF] arxiv.org

Observational overfitting in reinforcement learning

X Song, Y Jiang, S Tu, Y Du, B Neyshabur - arXiv preprint arXiv …, 2019 - arxiv.org

A major component of overfitting in model-free reinforcement learning (RL) involves the case
where the agent may mistakenly correlate reward with certain spurious features from the …

被引用次数：140 相关文章所有 6 个版本

[PDF] arxiv.org

Guided uncertainty-aware policy optimization: Combining learning and model-based strategies for sample-efficient policy learning

MA Lee, C Florensa, J Tremblay… - … on robotics and …, 2020 - ieeexplore.ieee.org

Traditional robotic approaches rely on an accurate model of the environment, a detailed
description of how to perform the task, and a robust perception system to keep track of the …

被引用次数：66 相关文章所有 3 个版本

[PDF] arxiv.org

Evolving rewards to automate reinforcement learning

A Faust, A Francis, D Mehta - arXiv preprint arXiv:1905.07628, 2019 - arxiv.org

Many continuous control tasks have easily formulated objectives, yet using them directly as
a reward in reinforcement learning (RL) leads to suboptimal policies. Therefore, many …

被引用次数：60 相关文章所有 5 个版本

[PDF] arxiv.org

Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees

Y Luo, H Xu, Y Li, Y Tian, T Darrell, T Ma - arXiv preprint arXiv:1807.03858, 2018 - arxiv.org

Model-based reinforcement learning (RL) is considered to be a promising approach to
reduce the sample complexity that hinders model-free RL. However, the theoretical …

被引用次数：232 相关文章所有 4 个版本

高级搜索

QQ 群

Robust model-free reinforcement learning with multi-objective Bayesian optimization

Risk-averse offline reinforcement learning

Barc: Backward reachability curriculum for robotic reinforcement learning

Towards tractable optimism in model-based reinforcement learning

Robust reinforcement learning via genetic curriculum

Lyapunov design for robust and efficient robotic reinforcement learning

Observational overfitting in reinforcement learning

Guided uncertainty-aware policy optimization: Combining learning and model-based strategies for sample-efficient policy learning

Evolving rewards to automate reinforcement learning

Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees

引用