A unified game-theoretic approach to multiagent reinforcement learning

M Lanctot, V Zambaldi, A Gruslys… - Advances in neural …, 2017 - proceedings.neurips.cc
There has been a resurgence of interest in multiagent reinforcement learning (MARL), due
partly to the recent success of deep neural networks. The simplest form of MARL is …

A policy gradient algorithm for learning to learn in multiagent reinforcement learning

DK Kim, M Liu, MD Riemer, C Sun… - International …, 2021 - proceedings.mlr.press
A fundamental challenge in multiagent reinforcement learning is to learn beneficial
behaviors in a shared environment with other simultaneously learning agents. In particular …

Multi-agent learning with policy prediction

C Zhang, V Lesser - Proceedings of the AAAI Conference on Artificial …, 2010 - ojs.aaai.org
Due to the non-stationary environment, learning in multi-agent systems is a challenging
problem. This paper first introduces a new gradient-based learning algorithm, augmenting …

Dealing with non-stationarity in multi-agent deep reinforcement learning

G Papoudakis, F Christianos, A Rahman… - arXiv preprint arXiv …, 2019 - arxiv.org
Recent developments in deep reinforcement learning are concerned with creating decision-
making agents which can perform well in various complex domains. A particular approach …

Game theory and multi-agent reinforcement learning

A Nowé, P Vrancx, YM De Hauwere - Reinforcement Learning: State-of …, 2012 - Springer
Reinforcement Learning was originally developed for Markov Decision Processes (MDPs). It
allows a single agent to learn a policy that maximizes a possibly delayed reward signal in a …

[PDF][PDF] Potential-based difference rewards for multiagent reinforcement learning

S Devlin, L Yliniemi, D Kudenko… - Proceedings of the 2014 …, 2014 - aamas.csc.liv.ac.uk
Difference rewards and potential-based reward shaping can both significantly improve the
joint policy learnt by multiple reinforcement learning agents acting simultaneously in the …

Prediction-based multi-agent reinforcement learning in inherently non-stationary environments

A Marinescu, I Dusparic, S Clarke - ACM Transactions on Autonomous …, 2017 - dl.acm.org
Multi-agent reinforcement learning (MARL) is a widely researched technique for
decentralised control in complex large-scale autonomous systems. Such systems often …

[PDF][PDF] Comparative evaluation of cooperative multi-agent deep reinforcement learning algorithms

G Papoudakis, F Christianos, L Schäfer… - arXiv preprint arXiv …, 2020 - ala2021.vub.ac.be
Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used
evaluation tasks and criteria, making comparisons between approaches difficult. In this work …

Deep reinforcement learning

M Krichen - 2023 14th International Conference on Computing …, 2023 - ieeexplore.ieee.org
Deep Reinforcement Learning (DRL) is a powerful technique for learning policies for
complex decision-making tasks. In this paper, we provide an overview of DRL, including its …

Deep multiagent reinforcement learning: Challenges and directions

A Wong, T Bäck, AV Kononova, A Plaat - Artificial Intelligence Review, 2023 - Springer
This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …