Maven: Multi-agent variational exploration

A Mahajan, T Rashid, M Samvelyan… - Advances in neural …, 2019 - proceedings.neurips.cc
Centralised training with decentralised execution is an important setting for cooperative
deep multi-agent reinforcement learning due to communication constraints during execution …

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arXiv preprint arXiv …, 2020 - arxiv.org
Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

Uneven: Universal value exploration for multi-agent reinforcement learning

T Gupta, A Mahajan, B Peng… - International …, 2021 - proceedings.mlr.press
VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …

Tesseract: Tensorised actors for multi-agent reinforcement learning

A Mahajan, M Samvelyan, L Mao… - International …, 2021 - proceedings.mlr.press
Reinforcement Learning in large action spaces is a challenging problem. This is especially
true for cooperative multi-agent reinforcement learning (MARL), which often requires …

Virel: A variational inference framework for reinforcement learning

M Fellows, A Mahajan, TGJ Rudner… - Advances in neural …, 2019 - proceedings.neurips.cc
Applying probabilistic models to reinforcement learning (RL) enables the uses of powerful
optimisation tools such as variational inference in RL. However, existing inference …

Invariant transform experience replay: Data augmentation for deep reinforcement learning

Y Lin, J Huang, M Zimmer, Y Guan… - IEEE Robotics and …, 2020 - ieeexplore.ieee.org
Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but
its current application to robotics is currently hindered by high sample requirements. To …

[HTML][HTML] Addressing imperfect symmetry: a novel symmetry-learning actor-critic extension

M Abreu, LP Reis, N Lau - Neurocomputing, 2025 - Elsevier
Symmetry, a fundamental concept to understand our environment, often oversimplifies
reality from a mathematical perspective. Humans are a prime example, deviating from …

Knowledge-guided exploration in deep reinforcement learning

S Mazumder, B Liu, S Wang, Y Zhu, X Yin… - arXiv preprint arXiv …, 2022 - arxiv.org
This paper proposes a new method to drastically speed up deep reinforcement learning
(deep RL) training for problems that have the property of state-action permissibility (SAP) …

[PDF][PDF] Action permissibility in deep reinforcement learning and application to autonomous driving

S Mazumder, B Liu, S Wang, Y Zhu, L Liu… - KDD'18 Deep Learning …, 2018 - kdd.org
This paper is concerned with deep reinforcement learning (deep RL) in continuous state and
action space. It proposes a new method that can drastically speed up RL training for …

Generalization Across Observation Shifts in Reinforcement Learning

A Mahajan, A Zhang - arXiv preprint arXiv:2306.04595, 2023 - arxiv.org
Learning policies which are robust to changes in the environment are critical for real world
deployment of Reinforcement Learning agents. They are also necessary for achieving good …