A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …

Safe learning in robotics: From learning-based control to safe reinforcement learning

L Brunke, M Greeff, AW Hall, Z Yuan… - Annual Review of …, 2022 - annualreviews.org
The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …

Do the rewards justify the means? measuring trade-offs between rewards and ethical behavior in the machiavelli benchmark

A Pan, JS Chan, A Zou, N Li, S Basart… - International …, 2023 - proceedings.mlr.press
Artificial agents have traditionally been trained to maximize reward, which may incentivize
power-seeking and deception, analogous to how next-token prediction in language models …

A review of reinforcement learning based energy management systems for electrified powertrains: Progress, challenge, and potential solution

AH Ganesh, B Xu - Renewable and Sustainable Energy Reviews, 2022 - Elsevier
The impact of internal combustion engine-powered automobiles on climate change due to
emissions and the depletion of fossil fuels has contributed to the progress of electrified …

A survey of safety and trustworthiness of large language models through the lens of verification and validation

X Huang, W Ruan, W Huang, G Jin, Y Dong… - Artificial Intelligence …, 2024 - Springer
Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …

End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks

R Cheng, G Orosz, RM Murray, JW Burdick - Proceedings of the AAAI …, 2019 - aaai.org
Reinforcement Learning (RL) algorithms have found limited success beyond simulated
applications, and one main reason is the absence of safety guarantees during the learning …

Recovery rl: Safe reinforcement learning with learned recovery zones

B Thananjeyan, A Balakrishna, S Nair… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
Safety remains a central obstacle preventing widespread use of RL in the real world:
learning new tasks in uncertain environments requires extensive exploration, but safety …

NNV: the neural network verification tool for deep neural networks and learning-enabled cyber-physical systems

HD Tran, X Yang, D Manzanas Lopez, P Musau… - … on Computer Aided …, 2020 - Springer
This paper presents the Neural Network Verification (NNV) software tool, a set-based
verification framework for deep neural networks (DNNs) and learning-enabled cyber …

Constrained variational policy optimization for safe reinforcement learning

Z Liu, Z Cen, V Isenbaev, W Liu, S Wu… - International …, 2022 - proceedings.mlr.press
Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before
deploying them to safety-critical applications. Previous primal-dual style approaches suffer …

A review of robot learning for manipulation: Challenges, representations, and algorithms

O Kroemer, S Niekum, G Konidaris - Journal of machine learning research, 2021 - jmlr.org
A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …