Deep reinforcement learning: A brief survey

K Arulkumaran, MP Deisenroth… - IEEE Signal …, 2017 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) is poised to revolutionize the field of artificial intelligence
(AI) and represents a step toward building autonomous systems with a higher-level …

A brief survey of deep reinforcement learning

K Arulkumaran, MP Deisenroth, M Brundage… - arXiv preprint arXiv …, 2017 - arxiv.org
Deep reinforcement learning is poised to revolutionise the field of AI and represents a step
towards building autonomous systems with a higher level understanding of the visual world …

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

B Ellis, J Cook, S Moalla… - Advances in …, 2024 - proceedings.neurips.cc
The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …

The starcraft multi-agent challenge

M Samvelyan, T Rashid, CS De Witt… - arXiv preprint arXiv …, 2019 - arxiv.org
In the last few years, deep multi-agent reinforcement learning (RL) has become a highly
active area of research. A particularly challenging class of problems in this area is partially …

An introduction to deep reinforcement learning

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018 - nowpublishers.com
Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …

[图书][B] Synthetic data for deep learning

SI Nikolenko - 2021 - Springer
You are holding in your hands… oh, come on, who holds books like this in their hands
anymore? Anyway, you are reading this, and it means that I have managed to release one of …

Monotonic value function factorisation for deep multi-agent reinforcement learning

T Rashid, M Samvelyan, CS De Witt, G Farquhar… - Journal of Machine …, 2020 - jmlr.org
In many real-world settings, a team of agents must coordinate its behaviour while acting in a
decentralised fashion. At the same time, it is often possible to train the agents in a …

Counterfactual multi-agent policy gradients

J Foerster, G Farquhar, T Afouras, N Nardelli… - Proceedings of the …, 2018 - ojs.aaai.org
Many real-world problems, such as network packet routing and the coordination of
autonomous vehicles, are naturally modelled as cooperative multi-agent systems. There is a …

Starcraft ii: A new challenge for reinforcement learning

O Vinyals, T Ewalds, S Bartunov, P Georgiev… - arXiv preprint arXiv …, 2017 - arxiv.org
This paper introduces SC2LE (StarCraft II Learning Environment), a reinforcement learning
environment based on the StarCraft II game. This domain poses a new grand challenge for …

Deep reinforcement learning: An overview

Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org
We give an overview of recent exciting achievements of deep reinforcement learning (RL).
We discuss six core elements, six important mechanisms, and twelve applications. We start …