Inequity aversion improves cooperation in intertemporal social dilemmas

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …

被引用次数：200 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1345 相关文章所有 8 个版本

[PDF] springer.com

Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer

The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

被引用次数：511 相关文章所有 8 个版本

[PDF] neurips.cc

Pettingzoo: Gym for multi-agent reinforcement learning

J Terry, B Black, N Grammel… - Advances in …, 2021 - proceedings.neurips.cc

This paper introduces the PettingZoo library and the accompanying Agent Environment
Cycle (" AEC") games model. PettingZoo is a library of diverse sets of multi-agent …

被引用次数：286 相关文章所有 12 个版本

[PDF] neurips.cc

Collaborating with humans without human data

DJ Strouse, K McKee, M Botvinick… - Advances in …, 2021 - proceedings.neurips.cc

Collaborating with humans requires rapidly adapting to their individual strengths,
weaknesses, and preferences. Unfortunately, most standard multi-agent reinforcement …

被引用次数：145 相关文章所有 6 个版本

[PDF] neurips.cc

On the utility of learning about humans for human-ai coordination

M Carroll, R Shah, MK Ho, T Griffiths… - Advances in neural …, 2019 - proceedings.neurips.cc

While we would like agents that can coordinate with humans, current algorithms such as self-
play and population-based training create agents that can coordinate with themselves …

被引用次数：353 相关文章所有 12 个版本

[PDF] arxiv.org

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arXiv preprint arXiv …, 2020 - arxiv.org

Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

被引用次数：195 相关文章所有 5 个版本

[PDF] mlr.press

Social influence as intrinsic motivation for multi-agent deep reinforcement learning

N Jaques, A Lazaridou, E Hughes… - International …, 2019 - proceedings.mlr.press

We propose a unified mechanism for achieving coordination and communication in Multi-
Agent Reinforcement Learning (MARL), through rewarding agents for having causal …

被引用次数：482 相关文章所有 6 个版本

[PDF] mlr.press

Scalable evaluation of multi-agent reinforcement learning with melting pot

JZ Leibo, EA Dueñez-Guzman… - International …, 2021 - proceedings.mlr.press

Existing evaluation suites for multi-agent reinforcement learning (MARL) do not assess
generalization to novel situations as their primary objective (unlike supervised learning …

被引用次数：81 相关文章所有 6 个版本

[PDF] mlr.press

Fop: Factorizing optimal joint policy of maximum-entropy multi-agent reinforcement learning

T Zhang, Y Li, C Wang, G Xie… - … conference on machine …, 2021 - proceedings.mlr.press

Value decomposition recently injects vigorous vitality into multi-agent actor-critic methods.
However, existing decomposed actor-critic methods cannot guarantee the convergence of …

被引用次数：77 相关文章所有 3 个版本

高级搜索

QQ 群