Deep interactive bayesian reinforcement learning via meta-learning

J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf… - arXiv preprint arXiv …, 2023 - arxiv.org

While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

被引用次数：165 相关文章所有 2 个版本

[图书][B] Multi-agent reinforcement learning: Foundations and modern approaches

SV Albrecht, F Christianos, L Schäfer - 2024 - books.google.com

The first comprehensive introduction to Multi-Agent Reinforcement Learning (MARL),
covering MARL's models, solution concepts, algorithmic ideas, technical challenges, and …

被引用次数：130 相关文章

[PDF] neurips.cc

Learning to ground multi-agent communication with autoencoders

T Lin, J Huh, C Stauffer, SN Lim… - Advances in Neural …, 2021 - proceedings.neurips.cc

Communication requires having a common language, a lingua franca, between agents. This
language could emerge via a consensus process, but it may require many generations of …

被引用次数：64 相关文章所有 9 个版本

[PDF] arxiv.org

A survey of ad hoc teamwork research

R Mirsky, I Carlucho, A Rahman, E Fosong… - European conference on …, 2022 - Springer

Ad hoc teamwork is the research problem of designing agents that can collaborate with new
teammates without prior coordination. This survey makes a two-fold contribution: First, it …

被引用次数：48 相关文章所有 12 个版本

[PDF] neurips.cc

Agent modelling under partial observability for deep reinforcement learning

G Papoudakis, F Christianos… - Advances in Neural …, 2021 - proceedings.neurips.cc

Modelling the behaviours of other agents is essential for understanding how agents interact
and making effective decisions. Existing methods for agent modelling commonly assume …

被引用次数：66 相关文章所有 7 个版本

[PDF] academia.edu

[PDF][PDF] A survey of ad hoc teamwork: Definitions, methods, and open problems

R Mirsky, I Carlucho, A Rahman, E Fosong… - … on Multiagent Systems, 2022 - academia.edu

Ad hoc teamwork is the well-established research problem of designing agents that can
collaborate with new teammates without prior coordination. This survey makes a two-fold …

被引用次数：33 相关文章所有 3 个版本

[PDF] aaai.org

Minimum coverage sets for training robust ad hoc teamwork agents

M Rahman, J Cui, P Stone - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Robustly cooperating with unseen agents and human partners presents significant
challenges due to the diverse cooperative conventions these partners may adopt. Existing …

被引用次数：7 相关文章所有 7 个版本

[PDF] mlr.press

Controlling type confounding in ad hoc teamwork with instance-wise teammate feedback rectification

D Xing, P Gu, Q Zheng, X Wang, S Liu… - International …, 2023 - proceedings.mlr.press

Ad hoc teamwork requires an agent to cooperate with unknown teammates without prior
coordination. Many works propose to abstract teammate instances into high-level …

被引用次数：2 相关文章所有 7 个版本

[PDF] aaai.org

Back to the future: Toward a hybrid architecture for ad hoc teamwork

H Dodampegama, M Sridharan - … of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

State of the art methods for ad hoc teamwork, ie, for collaboration without prior coordination,
often use a long history of prior observations to model the behavior of other agents (or agent …

被引用次数：9 相关文章所有 7 个版本

[PDF] ijcai.org

[PDF][PDF] Dynamic Belief for Decentralized Multi-Agent Cooperative Learning.

Y Zhai, P Peng, C Su, Y Tian - IJCAI, 2023 - ijcai.org

Decentralized multi-agent cooperative learning is a practical task due to the partially
observed setting both in training and execution. Every agent learns to cooperate without …

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群