Benchmarking multi-agent deep reinforcement learning algorithms in cooperative tasks

G Papoudakis, F Christianos, L Schäfer… - arXiv preprint arXiv …, 2020 - arxiv.org
Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used
evaluation tasks and criteria, making comparisons between approaches difficult. In this work …

A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arXiv preprint arXiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

Unsupervised representation learning in deep reinforcement learning: A review

N Botteghi, M Poel, C Brune - arXiv preprint arXiv:2208.14226, 2022 - arxiv.org
This review addresses the problem of learning abstract representations of the measurement
data in the context of Deep Reinforcement Learning (DRL). While the data are often …

Madiff: Offline multi-agent learning with diffusion models

Z Zhu, M Liu, L Mao, B Kang, M Xu, Y Yu… - arXiv preprint arXiv …, 2023 - arxiv.org
Diffusion model (DM) recently achieved huge success in various scenarios including offline
reinforcement learning, where the diffusion planner learn to generate desired trajectories …

A survey of ad hoc teamwork research

R Mirsky, I Carlucho, A Rahman, E Fosong… - European conference on …, 2022 - Springer
Ad hoc teamwork is the research problem of designing agents that can collaborate with new
teammates without prior coordination. This survey makes a two-fold contribution: First, it …

[PDF][PDF] A survey of ad hoc teamwork: Definitions, methods, and open problems

R Mirsky, I Carlucho, A Rahman, E Fosong… - … on Multiagent Systems, 2022 - academia.edu
Ad hoc teamwork is the well-established research problem of designing agents that can
collaborate with new teammates without prior coordination. This survey makes a two-fold …

Minimum coverage sets for training robust ad hoc teamwork agents

M Rahman, J Cui, P Stone - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Robustly cooperating with unseen agents and human partners presents significant
challenges due to the diverse cooperative conventions these partners may adopt. Existing …

Generating diverse cooperative agents by learning incompatible policies

R Charakorn, P Manoonpong… - … Conference on Learning …, 2023 - openreview.net
Training a robust cooperative agent requires diverse partner agents. However, obtaining
those agents is difficult. Previous works aim to learn diverse behaviors by changing the state …

面向多智能体博弈对抗的对手建模框架

罗俊仁, 张万鹏, 袁唯淋, 胡振震, 陈少飞… - 系统仿真学报, 2022 - china-simulation.com
对手建模作为多智能体博弈对抗的关键技术, 是一种典型的智能体认知行为建模方法.
介绍了多智能体博弈对抗几类典型模型, 非平稳问题和元博弈相关理论; 梳理总结对手建模方法 …

LINDA: Multi-agent local information decomposition for awareness of teammates

J Cao, L Yuan, J Wang, S Zhang, C Zhang, Y Yu… - Science China …, 2023 - Springer
In cooperative multi-agent reinforcement learning (MARL), where agents only have access
to partial observations, efficiently leveraging local information is critical. During long-time …