Cautious adaptation for reinforcement learning in safety-critical settings

L Brunke, M Greeff, AW Hall, Z Yuan… - Annual Review of …, 2022 - annualreviews.org

The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …

被引用次数：574 相关文章所有 9 个版本

[PDF] arxiv.org

A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer

Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

被引用次数：79 相关文章所有 4 个版本

[PDF] neurips.cc

Learning to synthesize programs as interpretable and generalizable policies

D Trivedi, J Zhang, SH Sun… - Advances in neural …, 2021 - proceedings.neurips.cc

Recently, deep reinforcement learning (DRL) methods have achieved impressive
performance on tasks in a variety of domains. However, neural network policies produced …

被引用次数：59 相关文章所有 7 个版本

A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning

EF Morales, R Murrieta-Cid, I Becerra… - Intelligent Service …, 2021 - Springer

This article is about deep learning (DL) and deep reinforcement learning (DRL) works
applied to robotics. Both tools have been shown to be successful in delivering data-driven …

被引用次数：60 相关文章所有 4 个版本

[PDF] arxiv.org

How to certify machine learning based safety-critical systems? A systematic literature review

F Tambon, G Laberge, L An, A Nikanjam… - Automated Software …, 2022 - Springer

Abstract Context Machine Learning (ML) has been at the heart of many innovations over the
past years. However, including it in so-called “safety-critical” systems such as automotive or …

被引用次数：69 相关文章所有 7 个版本

[PDF] arxiv.org

Probabilistic constraint for safety-critical reinforcement learning

W Chen, D Subramanian… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In this paper, we consider the problem of learning safe policies for probabilistic-constrained
reinforcement learning (RL). Specifically, a safe policy or controller is one that, with high …

被引用次数：11 相关文章所有 4 个版本

A collective AI via lifelong learning and sharing at the edge

A Soltoggio, E Ben-Iwhiwhu, V Braverman… - Nature Machine …, 2024 - nature.com

One vision of a future artificial intelligence (AI) is where many separate units can learn
independently over a lifetime and share their knowledge with each other. The synergy …

被引用次数：4 相关文章所有 4 个版本

[PDF] neurips.cc

A simple yet effective strategy to robustify the meta learning paradigm

Q Wang, Y Lv, Z Xie, J Huang - Advances in Neural …, 2024 - proceedings.neurips.cc

Meta learning is a promising paradigm to enable skill transfer across tasks. Most previous
methods employ the empirical risk minimization principle in optimization. However, the …

被引用次数：3 相关文章所有 5 个版本

[PDF] mlr.press

Safe driving via expert guided policy optimization

Z Peng, Q Li, C Liu, B Zhou - Conference on Robot Learning, 2022 - proceedings.mlr.press

When learning common skills like driving, beginners usually have domain experts standing
by to ensure the safety of the learning process. We formulate such learning scheme under …

被引用次数：36 相关文章所有 5 个版本

[PDF] mlr.press

Accelerating safe reinforcement learning with constraint-mismatched baseline policies

TY Yang, J Rosca, K Narasimhan… - … on Machine Learning, 2021 - proceedings.mlr.press

We consider the problem of reinforcement learning when provided with (1) a baseline
control policy and (2) a set of constraints that the learner must satisfy. The baseline policy …

被引用次数：25 相关文章所有 4 个版本

高级搜索

QQ 群