Safe learning in robotics: From learning-based control to safe reinforcement learning

L Brunke, M Greeff, AW Hall, Z Yuan… - Annual Review of …, 2022 - annualreviews.org
The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …

A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

Learning to synthesize programs as interpretable and generalizable policies

D Trivedi, J Zhang, SH Sun… - Advances in neural …, 2021 - proceedings.neurips.cc
Recently, deep reinforcement learning (DRL) methods have achieved impressive
performance on tasks in a variety of domains. However, neural network policies produced …

A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning

EF Morales, R Murrieta-Cid, I Becerra… - Intelligent Service …, 2021 - Springer
This article is about deep learning (DL) and deep reinforcement learning (DRL) works
applied to robotics. Both tools have been shown to be successful in delivering data-driven …

How to certify machine learning based safety-critical systems? A systematic literature review

F Tambon, G Laberge, L An, A Nikanjam… - Automated Software …, 2022 - Springer
Abstract Context Machine Learning (ML) has been at the heart of many innovations over the
past years. However, including it in so-called “safety-critical” systems such as automotive or …

Probabilistic constraint for safety-critical reinforcement learning

W Chen, D Subramanian… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In this paper, we consider the problem of learning safe policies for probabilistic-constrained
reinforcement learning (RL). Specifically, a safe policy or controller is one that, with high …

A collective AI via lifelong learning and sharing at the edge

A Soltoggio, E Ben-Iwhiwhu, V Braverman… - Nature Machine …, 2024 - nature.com
One vision of a future artificial intelligence (AI) is where many separate units can learn
independently over a lifetime and share their knowledge with each other. The synergy …

A simple yet effective strategy to robustify the meta learning paradigm

Q Wang, Y Lv, Z Xie, J Huang - Advances in Neural …, 2024 - proceedings.neurips.cc
Meta learning is a promising paradigm to enable skill transfer across tasks. Most previous
methods employ the empirical risk minimization principle in optimization. However, the …

Safe driving via expert guided policy optimization

Z Peng, Q Li, C Liu, B Zhou - Conference on Robot Learning, 2022 - proceedings.mlr.press
When learning common skills like driving, beginners usually have domain experts standing
by to ensure the safety of the learning process. We formulate such learning scheme under …

Accelerating safe reinforcement learning with constraint-mismatched baseline policies

TY Yang, J Rosca, K Narasimhan… - … on Machine Learning, 2021 - proceedings.mlr.press
We consider the problem of reinforcement learning when provided with (1) a baseline
control policy and (2) a set of constraints that the learner must satisfy. The baseline policy …