A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf… - arXiv preprint arXiv …, 2023 - arxiv.org
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

[图书][B] Multi-agent reinforcement learning: Foundations and modern approaches

SV Albrecht, F Christianos, L Schäfer - 2024 - books.google.com
The first comprehensive introduction to Multi-Agent Reinforcement Learning (MARL),
covering MARL's models, solution concepts, algorithmic ideas, technical challenges, and …

Learning to ground multi-agent communication with autoencoders

T Lin, J Huh, C Stauffer, SN Lim… - Advances in Neural …, 2021 - proceedings.neurips.cc
Communication requires having a common language, a lingua franca, between agents. This
language could emerge via a consensus process, but it may require many generations of …

A survey of ad hoc teamwork research

R Mirsky, I Carlucho, A Rahman, E Fosong… - European conference on …, 2022 - Springer
Ad hoc teamwork is the research problem of designing agents that can collaborate with new
teammates without prior coordination. This survey makes a two-fold contribution: First, it …

Agent modelling under partial observability for deep reinforcement learning

G Papoudakis, F Christianos… - Advances in Neural …, 2021 - proceedings.neurips.cc
Modelling the behaviours of other agents is essential for understanding how agents interact
and making effective decisions. Existing methods for agent modelling commonly assume …

[PDF][PDF] A survey of ad hoc teamwork: Definitions, methods, and open problems

R Mirsky, I Carlucho, A Rahman, E Fosong… - … on Multiagent Systems, 2022 - academia.edu
Ad hoc teamwork is the well-established research problem of designing agents that can
collaborate with new teammates without prior coordination. This survey makes a two-fold …

Minimum coverage sets for training robust ad hoc teamwork agents

M Rahman, J Cui, P Stone - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Robustly cooperating with unseen agents and human partners presents significant
challenges due to the diverse cooperative conventions these partners may adopt. Existing …

Controlling type confounding in ad hoc teamwork with instance-wise teammate feedback rectification

D Xing, P Gu, Q Zheng, X Wang, S Liu… - International …, 2023 - proceedings.mlr.press
Ad hoc teamwork requires an agent to cooperate with unknown teammates without prior
coordination. Many works propose to abstract teammate instances into high-level …

Back to the future: Toward a hybrid architecture for ad hoc teamwork

H Dodampegama, M Sridharan - … of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
State of the art methods for ad hoc teamwork, ie, for collaboration without prior coordination,
often use a long history of prior observations to model the behavior of other agents (or agent …

[PDF][PDF] Dynamic Belief for Decentralized Multi-Agent Cooperative Learning.

Y Zhai, P Peng, C Su, Y Tian - IJCAI, 2023 - ijcai.org
Decentralized multi-agent cooperative learning is a practical task due to the partially
observed setting both in training and execution. Every agent learns to cooperate without …