A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …

Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer
The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

Pettingzoo: Gym for multi-agent reinforcement learning

J Terry, B Black, N Grammel… - Advances in …, 2021 - proceedings.neurips.cc
This paper introduces the PettingZoo library and the accompanying Agent Environment
Cycle (" AEC") games model. PettingZoo is a library of diverse sets of multi-agent …

Collaborating with humans without human data

DJ Strouse, K McKee, M Botvinick… - Advances in …, 2021 - proceedings.neurips.cc
Collaborating with humans requires rapidly adapting to their individual strengths,
weaknesses, and preferences. Unfortunately, most standard multi-agent reinforcement …

On the utility of learning about humans for human-ai coordination

M Carroll, R Shah, MK Ho, T Griffiths… - Advances in neural …, 2019 - proceedings.neurips.cc
While we would like agents that can coordinate with humans, current algorithms such as self-
play and population-based training create agents that can coordinate with themselves …

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arXiv preprint arXiv …, 2020 - arxiv.org
Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

Social influence as intrinsic motivation for multi-agent deep reinforcement learning

N Jaques, A Lazaridou, E Hughes… - International …, 2019 - proceedings.mlr.press
We propose a unified mechanism for achieving coordination and communication in Multi-
Agent Reinforcement Learning (MARL), through rewarding agents for having causal …

Scalable evaluation of multi-agent reinforcement learning with melting pot

JZ Leibo, EA Dueñez-Guzman… - International …, 2021 - proceedings.mlr.press
Existing evaluation suites for multi-agent reinforcement learning (MARL) do not assess
generalization to novel situations as their primary objective (unlike supervised learning …

Fop: Factorizing optimal joint policy of maximum-entropy multi-agent reinforcement learning

T Zhang, Y Li, C Wang, G Xie… - … conference on machine …, 2021 - proceedings.mlr.press
Value decomposition recently injects vigorous vitality into multi-agent actor-critic methods.
However, existing decomposed actor-critic methods cannot guarantee the convergence of …