Safe reinforcement learning via shielding

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …

被引用次数：204 相关文章所有 2 个版本

[PDF] arxiv.org

Safe learning in robotics: From learning-based control to safe reinforcement learning

L Brunke, M Greeff, AW Hall, Z Yuan… - Annual Review of …, 2022 - annualreviews.org

The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …

被引用次数：530 相关文章所有 9 个版本

[PDF] mlr.press

Do the rewards justify the means? measuring trade-offs between rewards and ethical behavior in the machiavelli benchmark

A Pan, JS Chan, A Zou, N Li, S Basart… - International …, 2023 - proceedings.mlr.press

Artificial agents have traditionally been trained to maximize reward, which may incentivize
power-seeking and deception, analogous to how next-token prediction in language models …

被引用次数：90 相关文章所有 6 个版本

[PDF] researchgate.net

A review of reinforcement learning based energy management systems for electrified powertrains: Progress, challenge, and potential solution

AH Ganesh, B Xu - Renewable and Sustainable Energy Reviews, 2022 - Elsevier

The impact of internal combustion engine-powered automobiles on climate change due to
emissions and the depletion of fossil fuels has contributed to the progress of electrified …

被引用次数：151 相关文章所有 5 个版本

[PDF] springer.com

A survey of safety and trustworthiness of large language models through the lens of verification and validation

X Huang, W Ruan, W Huang, G Jin, Y Dong… - Artificial Intelligence …, 2024 - Springer

Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …

被引用次数：54 相关文章所有 6 个版本

[PDF] aaai.org

End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks

R Cheng, G Orosz, RM Murray, JW Burdick - Proceedings of the AAAI …, 2019 - aaai.org

Reinforcement Learning (RL) algorithms have found limited success beyond simulated
applications, and one main reason is the absence of safety guarantees during the learning …

被引用次数：640 相关文章所有 14 个版本

[PDF] arxiv.org

Recovery rl: Safe reinforcement learning with learned recovery zones

B Thananjeyan, A Balakrishna, S Nair… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

Safety remains a central obstacle preventing widespread use of RL in the real world:
learning new tasks in uncertain environments requires extensive exploration, but safety …

被引用次数：212 相关文章所有 6 个版本

[PDF] springer.com

NNV: the neural network verification tool for deep neural networks and learning-enabled cyber-physical systems

HD Tran, X Yang, D Manzanas Lopez, P Musau… - … on Computer Aided …, 2020 - Springer

This paper presents the Neural Network Verification (NNV) software tool, a set-based
verification framework for deep neural networks (DNNs) and learning-enabled cyber …

被引用次数：281 相关文章所有 19 个版本

[PDF] mlr.press

Constrained variational policy optimization for safe reinforcement learning

Z Liu, Z Cen, V Isenbaev, W Liu, S Wu… - International …, 2022 - proceedings.mlr.press

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before
deploying them to safety-critical applications. Previous primal-dual style approaches suffer …

被引用次数：69 相关文章所有 6 个版本

[PDF] jmlr.org

A review of robot learning for manipulation: Challenges, representations, and algorithms

O Kroemer, S Niekum, G Konidaris - Journal of machine learning research, 2021 - jmlr.org

A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …

被引用次数：354 相关文章所有 18 个版本

高级搜索

QQ 群