Emergent communication through negotiation

A Lazaridou, M Baroni - arXiv preprint arXiv:2006.02419, 2020 - arxiv.org

The ability to cooperate through language is a defining feature of humans. As the
perceptual, motory and planning capabilities of deep artificial networks increase …

被引用次数：184 相关文章所有 3 个版本

[HTML] springer.com

[HTML][HTML] Deep multiagent reinforcement learning: Challenges and directions

A Wong, T Bäck, AV Kononova, A Plaat - Artificial Intelligence Review, 2023 - Springer

This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …

被引用次数：85 相关文章所有 8 个版本

[HTML] springer.com

[HTML][HTML] Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer

The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

被引用次数：554 相关文章所有 8 个版本

[PDF] mlr.press

Social influence as intrinsic motivation for multi-agent deep reinforcement learning

N Jaques, A Lazaridou, E Hughes… - International …, 2019 - proceedings.mlr.press

We propose a unified mechanism for achieving coordination and communication in Multi-
Agent Reinforcement Learning (MARL), through rewarding agents for having causal …

被引用次数：502 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] The hanabi challenge: A new frontier for ai research

N Bard, JN Foerster, S Chandar, N Burch, M Lanctot… - Artificial Intelligence, 2020 - Elsevier

From the early days of computing, games have been important testbeds for studying how
well machines can do sophisticated decision making. In recent years, machine learning has …

被引用次数：413 相关文章所有 9 个版本

[PDF] arxiv.org

OpenSpiel: A framework for reinforcement learning in games

M Lanctot, E Lockhart, JB Lespiau, V Zambaldi… - arXiv preprint arXiv …, 2019 - arxiv.org

OpenSpiel is a collection of environments and algorithms for research in general
reinforcement learning and search/planning in games. OpenSpiel supports n-player (single …

被引用次数：263 相关文章所有 6 个版本

[PDF] arxiv.org

Multi-step retriever-reader interaction for scalable open-domain question answering

R Das, S Dhuliawala, M Zaheer… - arXiv preprint arXiv …, 2019 - arxiv.org

This paper introduces a new framework for open-domain question answering in which the
retriever and the reader iteratively interact with each other. The framework is agnostic to the …

被引用次数：211 相关文章所有 4 个版本

[PDF] openreview.net

Emergent communication at scale

R Chaabouni, F Strub, F Altché, E Tarassov… - International …, 2022 - openreview.net

Emergent communication aims for a better understanding of human language evolution and
building more efficient representations. We posit that reaching these goals will require …

被引用次数：60 相关文章

[PDF] neurips.cc

Actor-critic policy optimization in partially observable multiagent environments

S Srinivasan, M Lanctot, V Zambaldi… - Advances in neural …, 2018 - proceedings.neurips.cc

Optimization of parameterized policies for reinforcement learning (RL) is an important and
challenging problem in artificial intelligence. Among the most common approaches are …

被引用次数：164 相关文章所有 9 个版本

[PDF] arxiv.org

Decoupling strategy and generation in negotiation dialogues

H He, D Chen, A Balakrishnan, P Liang - arXiv preprint arXiv:1808.09637, 2018 - arxiv.org

We consider negotiation settings in which two agents use natural language to bargain on
goods. Agents need to decide on both high-level strategy (eg, proposing\$50) and the …

被引用次数：161 相关文章所有 6 个版本

高级搜索

QQ 群