Emergent multi-agent communication in the deep learning era

A Lazaridou, M Baroni - arXiv preprint arXiv:2006.02419, 2020 - arxiv.org
The ability to cooperate through language is a defining feature of humans. As the
perceptual, motory and planning capabilities of deep artificial networks increase …

[HTML][HTML] Deep multiagent reinforcement learning: Challenges and directions

A Wong, T Bäck, AV Kononova, A Plaat - Artificial Intelligence Review, 2023 - Springer
This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …

[HTML][HTML] Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer
The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

Social influence as intrinsic motivation for multi-agent deep reinforcement learning

N Jaques, A Lazaridou, E Hughes… - International …, 2019 - proceedings.mlr.press
We propose a unified mechanism for achieving coordination and communication in Multi-
Agent Reinforcement Learning (MARL), through rewarding agents for having causal …

[HTML][HTML] The hanabi challenge: A new frontier for ai research

N Bard, JN Foerster, S Chandar, N Burch, M Lanctot… - Artificial Intelligence, 2020 - Elsevier
From the early days of computing, games have been important testbeds for studying how
well machines can do sophisticated decision making. In recent years, machine learning has …

OpenSpiel: A framework for reinforcement learning in games

M Lanctot, E Lockhart, JB Lespiau, V Zambaldi… - arXiv preprint arXiv …, 2019 - arxiv.org
OpenSpiel is a collection of environments and algorithms for research in general
reinforcement learning and search/planning in games. OpenSpiel supports n-player (single …

Multi-step retriever-reader interaction for scalable open-domain question answering

R Das, S Dhuliawala, M Zaheer… - arXiv preprint arXiv …, 2019 - arxiv.org
This paper introduces a new framework for open-domain question answering in which the
retriever and the reader iteratively interact with each other. The framework is agnostic to the …

Emergent communication at scale

R Chaabouni, F Strub, F Altché, E Tarassov… - International …, 2022 - openreview.net
Emergent communication aims for a better understanding of human language evolution and
building more efficient representations. We posit that reaching these goals will require …

Actor-critic policy optimization in partially observable multiagent environments

S Srinivasan, M Lanctot, V Zambaldi… - Advances in neural …, 2018 - proceedings.neurips.cc
Optimization of parameterized policies for reinforcement learning (RL) is an important and
challenging problem in artificial intelligence. Among the most common approaches are …

Decoupling strategy and generation in negotiation dialogues

H He, D Chen, A Balakrishnan, P Liang - arXiv preprint arXiv:1808.09637, 2018 - arxiv.org
We consider negotiation settings in which two agents use natural language to bargain on
goods. Agents need to decide on both high-level strategy (eg, proposing\$50) and the …