Symmetry learning for function approximation in reinforcement learning

A Mahajan, T Rashid, M Samvelyan… - Advances in neural …, 2019 - proceedings.neurips.cc

Centralised training with decentralised execution is an important setting for cooperative
deep multi-agent reinforcement learning due to communication constraints during execution …

被引用次数：442 相关文章所有 11 个版本

[PDF] arxiv.org

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arXiv preprint arXiv …, 2020 - arxiv.org

Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

被引用次数：231 相关文章所有 5 个版本

[PDF] neurips.cc

Mdp homomorphic networks: Group symmetries in reinforcement learning

E Van der Pol, D Worrall, H van Hoof… - Advances in …, 2020 - proceedings.neurips.cc

This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP
homomorphic networks are neural networks that are equivariant under symmetries in the …

被引用次数：170 相关文章所有 13 个版本

[PDF] mlr.press

Uneven: Universal value exploration for multi-agent reinforcement learning

T Gupta, A Mahajan, B Peng… - International …, 2021 - proceedings.mlr.press

VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …

被引用次数：55 相关文章所有 11 个版本

[PDF] neurips.cc

Continuous mdp homomorphisms and homomorphic policy gradient

S Rezaei-Shoshtari, R Zhao… - Advances in …, 2022 - proceedings.neurips.cc

Abstraction has been widely studied as a way to improve the efficiency and generalization of
reinforcement learning algorithms. In this paper, we study abstraction in the continuous …

被引用次数：24 相关文章所有 6 个版本

[PDF] academia.edu

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arXiv preprint arXiv:2306.16021, 2023 - academia.edu

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

被引用次数：17 相关文章所有 2 个版本

[PDF] mlr.press

Tesseract: Tensorised actors for multi-agent reinforcement learning

A Mahajan, M Samvelyan, L Mao… - International …, 2021 - proceedings.mlr.press

Reinforcement Learning in large action spaces is a challenging problem. This is especially
true for cooperative multi-agent reinforcement learning (MARL), which often requires …

被引用次数：42 相关文章所有 10 个版本

[PDF] neurips.cc

Virel: A variational inference framework for reinforcement learning

M Fellows, A Mahajan, TGJ Rudner… - Advances in neural …, 2019 - proceedings.neurips.cc

Applying probabilistic models to reinforcement learning (RL) enables the uses of powerful
optimisation tools such as variational inference in RL. However, existing inference …

被引用次数：62 相关文章所有 15 个版本

[PDF] sqz.ac.cn

Efficient bimanual handover and rearrangement via symmetry-aware actor-critic learning

Y Li, C Pan, H Xu, X Wang, Y Wu - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Bimanual manipulation is important for building intelligent robots that unlock richer skills
than single arms. We consider a multi-object bimanual rearrangement task, where a …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Invariant transform experience replay: Data augmentation for deep reinforcement learning

Y Lin, J Huang, M Zimmer, Y Guan… - IEEE Robotics and …, 2020 - ieeexplore.ieee.org

Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but
its current application to robotics is currently hindered by high sample requirements. To …

被引用次数：46 相关文章所有 7 个版本

高级搜索

QQ 群