BAFFLE: Backdoor Attack in Offline Reinforcement Learning

C Gong, Z Yang, Y Bai, J He, J Shi, K Li… - arXiv preprint arXiv …, 2022 - arxiv.org
A growing body of research has focused on the Reinforcement Learning (RL) methods
which allow the agent to learn from trial-and-error experiences gathered during the …

BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets

C Gong, Z Yang, Y Bai, J Shi, J He, K Li… - 2024 IEEE Symposium …, 2024 - computer.org
Reinforcement learning (RL) makes an agent learn from trial-and-error experiences
gathered during the interaction with the environment. Recently, offline RL has become a …

Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses

T Nguyen, TM Luu, T Ton, CD Yoo - arXiv preprint arXiv:2405.11206, 2024 - arxiv.org
Offline reinforcement learning (RL) addresses the challenge of expensive and high-risk data
exploration inherent in RL by pre-training policies on vast amounts of offline data, enabling …

Reward Poisoning Attack Against Offline Reinforcement Learning

Y Xu, R Gumaste, G Singh - arXiv preprint arXiv:2402.09695, 2024 - arxiv.org
We study the problem of reward poisoning attacks against general offline reinforcement
learning with deep neural networks for function approximation. We consider a black-box …

Offline Reward Perturbation Boosts Distributional Shift in Online RL

Z Yu, S Kang, X Zhang - The 40th Conference on Uncertainty in Artificial … - openreview.net
Offline-to-online reinforcement learning has recently been shown effective in reducing the
online sample complexity by first training from offline collected data. However, this additional …

Copa: Certifying robust policies for offline reinforcement learning against poisoning attacks

F Wu, L Li, C Xu, H Zhang, B Kailkhura… - arXiv preprint arXiv …, 2022 - arxiv.org
As reinforcement learning (RL) has achieved near human-level performance in a variety of
tasks, its robustness has raised great attention. While a vast body of research has explored …

Robust Offline Reinforcement Learning--Certify the Confidence Interval

J Yao, SS Du - arXiv preprint arXiv:2309.16631, 2023 - arxiv.org
Currently, reinforcement learning (RL), especially deep RL, has received more and more
attention in the research area. However, the security of RL has been an obvious problem …

MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator

XY Liu, XH Zhou, GT Li, H Li, MJ Gui, TY Xiang… - arXiv preprint arXiv …, 2023 - arxiv.org
Offline reinforcement learning (RL) faces a significant challenge of distribution shift. Model-
free offline RL penalizes the Q value for out-of-distribution (OOD) data or constrains the …

A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …

Guard: A safe reinforcement learning benchmark

W Zhao, R Chen, Y Sun, R Liu, T Wei, C Liu - arXiv preprint arXiv …, 2023 - arxiv.org
Due to the trial-and-error nature, it is typically challenging to apply RL algorithms to safety-
critical real-world applications, such as autonomous driving, human-robot interaction, robot …