A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

[HTML][HTML] Robot learning towards smart robotic manufacturing: A review

Z Liu, Q Liu, W Xu, L Wang, Z Zhou - Robotics and Computer-Integrated …, 2022 - Elsevier
Robotic equipment has been playing a central role since the proposal of smart
manufacturing. Since the beginning of the first integration of industrial robots into production …

COLREG-compliant collision avoidance for unmanned surface vehicle using deep reinforcement learning

E Meyer, A Heiberg, A Rasheed, O San - Ieee Access, 2020 - ieeexplore.ieee.org
Path Following and Collision Avoidance, be it for unmanned surface vessels or other
autonomous vehicles, are two fundamental guidance problems in robotics. For many …

Inverse reinforcement learning as the algorithmic basis for theory of mind: current methods and open problems

J Ruiz-Serra, MS Harré - Algorithms, 2023 - mdpi.com
Theory of mind (ToM) is the psychological construct by which we model another's internal
mental states. Through ToM, we adjust our own behaviour to best suit a social context, and …

LEMURS: Learning distributed multi-robot interactions

E Sebastián, T Duong, N Atanasov… - … on Robotics and …, 2023 - ieeexplore.ieee.org
This paper presents LEMURS, an algorithm for learning scalable multi-robot control policies
from cooperative task demonstrations. We propose a port-Hamiltonian description of the …

Memory-augmented Lyapunov-based safe reinforcement learning: end-to-end safety under uncertainty

AB Jeddi, NL Dehghani… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Despite recent advances in safe reinforcement learning (RL), safety constraints are often
violated at deployment, especially under extreme uncertainty in memory-based partially …

Semi-Supervised Imitation Learning of Team Policies from Suboptimal Demonstrations

S Seo, VV Unhelkar - arXiv preprint arXiv:2205.02959, 2022 - arxiv.org
We present Bayesian Team Imitation Learner (BTIL), an imitation learning algorithm to
model the behavior of teams performing sequential tasks in Markovian domains. In contrast …

[PDF][PDF] A hierarchical bayesian process for inverse rl in partially-controlled environments

K Bogert, P Doshi - AAMAS Conference proceedings, 2022 - par.nsf.gov
A known modality for imitation learning in robotics [15] is to employ a sensor suite to learn
from observing (LfO) the expert. The sensed data constitutes a trajectory, usually modeled …

Collective Anomaly Perception During Multi-Robot Patrol: Constrained Interactions Can Promote Accurate Consensus

ZR Madin, J Lawry, ER Hunt - Proceedings of the 39th ACM/SIGAPP …, 2024 - dl.acm.org
An important real-world application of multi-robot systems is multi-robot patrolling (MRP),
where robots must carry out the activity of going through an area at regular intervals …

On the impact of gravity compensation on reinforcement learning in goal-reaching tasks for robotic manipulators

J Fugal, J Bae, HA Poonawala - Robotics, 2021 - mdpi.com
Advances in machine learning technologies in recent years have facilitated developments in
autonomous robotic systems. Designing these autonomous systems typically requires …