Symmetry detection and exploitation for function approximation in deep rl

A Mahajan, T Rashid, M Samvelyan… - Advances in neural …, 2019 - proceedings.neurips.cc

Centralised training with decentralised execution is an important setting for cooperative
deep multi-agent reinforcement learning due to communication constraints during execution …

被引用次数：442 相关文章所有 11 个版本

[PDF] arxiv.org

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arXiv preprint arXiv …, 2020 - arxiv.org

Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

被引用次数：231 相关文章所有 5 个版本

[PDF] mlr.press

Uneven: Universal value exploration for multi-agent reinforcement learning

T Gupta, A Mahajan, B Peng… - International …, 2021 - proceedings.mlr.press

VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …

被引用次数：55 相关文章所有 11 个版本

[PDF] mlr.press

Tesseract: Tensorised actors for multi-agent reinforcement learning

A Mahajan, M Samvelyan, L Mao… - International …, 2021 - proceedings.mlr.press

Reinforcement Learning in large action spaces is a challenging problem. This is especially
true for cooperative multi-agent reinforcement learning (MARL), which often requires …

被引用次数：42 相关文章所有 10 个版本

[PDF] neurips.cc

Virel: A variational inference framework for reinforcement learning

M Fellows, A Mahajan, TGJ Rudner… - Advances in neural …, 2019 - proceedings.neurips.cc

Applying probabilistic models to reinforcement learning (RL) enables the uses of powerful
optimisation tools such as variational inference in RL. However, existing inference …

被引用次数：62 相关文章所有 15 个版本

[PDF] arxiv.org

Invariant transform experience replay: Data augmentation for deep reinforcement learning

Y Lin, J Huang, M Zimmer, Y Guan… - IEEE Robotics and …, 2020 - ieeexplore.ieee.org

Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but
its current application to robotics is currently hindered by high sample requirements. To …

被引用次数：46 相关文章所有 7 个版本

[HTML] sciencedirect.com

[HTML][HTML] Addressing imperfect symmetry: a novel symmetry-learning actor-critic extension

M Abreu, LP Reis, N Lau - Neurocomputing, 2025 - Elsevier

Symmetry, a fundamental concept to understand our environment, often oversimplifies
reality from a mathematical perspective. Humans are a prime example, deviating from …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Knowledge-guided exploration in deep reinforcement learning

S Mazumder, B Liu, S Wang, Y Zhu, X Yin… - arXiv preprint arXiv …, 2022 - arxiv.org

This paper proposes a new method to drastically speed up deep reinforcement learning
(deep RL) training for problems that have the property of state-action permissibility (SAP) …

被引用次数：3 相关文章所有 2 个版本

[PDF] kdd.org

[PDF][PDF] Action permissibility in deep reinforcement learning and application to autonomous driving

S Mazumder, B Liu, S Wang, Y Zhu, L Liu… - KDD'18 Deep Learning …, 2018 - kdd.org

This paper is concerned with deep reinforcement learning (deep RL) in continuous state and
action space. It proposes a new method that can drastically speed up RL training for …

被引用次数：9 相关文章

[PDF] arxiv.org

Generalization Across Observation Shifts in Reinforcement Learning

A Mahajan, A Zhang - arXiv preprint arXiv:2306.04595, 2023 - arxiv.org

Learning policies which are robust to changes in the environment are critical for real world
deployment of Reinforcement Learning agents. They are also necessary for achieving good …

高级搜索

QQ 群