Maven: Multi-agent variational exploration

A Mahajan, T Rashid, M Samvelyan… - Advances in neural …, 2019 - proceedings.neurips.cc
Centralised training with decentralised execution is an important setting for cooperative
deep multi-agent reinforcement learning due to communication constraints during execution …

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arXiv preprint arXiv …, 2020 - arxiv.org
Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

Mdp homomorphic networks: Group symmetries in reinforcement learning

E Van der Pol, D Worrall, H van Hoof… - Advances in …, 2020 - proceedings.neurips.cc
This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP
homomorphic networks are neural networks that are equivariant under symmetries in the …

Uneven: Universal value exploration for multi-agent reinforcement learning

T Gupta, A Mahajan, B Peng… - International …, 2021 - proceedings.mlr.press
VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …

Continuous mdp homomorphisms and homomorphic policy gradient

S Rezaei-Shoshtari, R Zhao… - Advances in …, 2022 - proceedings.neurips.cc
Abstraction has been widely studied as a way to improve the efficiency and generalization of
reinforcement learning algorithms. In this paper, we study abstraction in the continuous …

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arXiv preprint arXiv:2306.16021, 2023 - academia.edu
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Tesseract: Tensorised actors for multi-agent reinforcement learning

A Mahajan, M Samvelyan, L Mao… - International …, 2021 - proceedings.mlr.press
Reinforcement Learning in large action spaces is a challenging problem. This is especially
true for cooperative multi-agent reinforcement learning (MARL), which often requires …

Virel: A variational inference framework for reinforcement learning

M Fellows, A Mahajan, TGJ Rudner… - Advances in neural …, 2019 - proceedings.neurips.cc
Applying probabilistic models to reinforcement learning (RL) enables the uses of powerful
optimisation tools such as variational inference in RL. However, existing inference …

Efficient bimanual handover and rearrangement via symmetry-aware actor-critic learning

Y Li, C Pan, H Xu, X Wang, Y Wu - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Bimanual manipulation is important for building intelligent robots that unlock richer skills
than single arms. We consider a multi-object bimanual rearrangement task, where a …

Invariant transform experience replay: Data augmentation for deep reinforcement learning

Y Lin, J Huang, M Zimmer, Y Guan… - IEEE Robotics and …, 2020 - ieeexplore.ieee.org
Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but
its current application to robotics is currently hindered by high sample requirements. To …