A unified game-theoretic approach to multiagent reinforcement learning

Y Yang, J Wang - arXiv preprint arXiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

被引用次数：292 相关文章所有 2 个版本

[PDF] arxiv.org

Deep reinforcement learning for multiagent systems: A review of challenges, solutions, and applications

TT Nguyen, ND Nguyen… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

Reinforcement learning (RL) algorithms have been around for decades and employed to
solve various sequential decision-making problems. These algorithms, however, have faced …

被引用次数：1005 相关文章所有 12 个版本

[PDF] arxiv.org

Mastering the game of Stratego with model-free multiagent reinforcement learning

J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub… - Science, 2022 - science.org

We introduce DeepNash, an autonomous agent that plays the imperfect information game
Stratego at a human expert level. Stratego is one of the few iconic board games that artificial …

被引用次数：174 相关文章所有 6 个版本

[HTML] science.org

Learning agile soccer skills for a bipedal robot with deep reinforcement learning

T Haarnoja, B Moran, G Lever, SH Huang… - Science Robotics, 2024 - science.org

We investigated whether deep reinforcement learning (deep RL) is able to synthesize
sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be …

被引用次数：58 相关文章所有 7 个版本

[HTML] springer.com

[HTML][HTML] Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer

The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

被引用次数：527 相关文章所有 8 个版本

[PDF] arxiv.org

Self-play fine-tuning converts weak language models to strong language models

Z Chen, Y Deng, H Yuan, K Ji, Q Gu - arXiv preprint arXiv:2401.01335, 2024 - arxiv.org

Harnessing the power of human-annotated data through Supervised Fine-Tuning (SFT) is
pivotal for advancing Large Language Models (LLMs). In this paper, we delve into the …

被引用次数：94 相关文章所有 3 个版本

Grandmaster level in StarCraft II using multi-agent reinforcement learning

O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu… - nature, 2019 - nature.com

Many real-world applications require artificial agents to compete and coordinate with other
agents in complex environments. As a stepping stone to this goal, the domain of StarCraft …

被引用次数：4449 相关文章所有 11 个版本

[PDF] neurips.cc

Collaborating with humans without human data

DJ Strouse, K McKee, M Botvinick… - Advances in …, 2021 - proceedings.neurips.cc

Collaborating with humans requires rapidly adapting to their individual strengths,
weaknesses, and preferences. Unfortunately, most standard multi-agent reinforcement …

被引用次数：146 相关文章所有 6 个版本

[PDF] ieee.org

Survey of deep reinforcement learning for motion planning of autonomous vehicles

S Aradi - IEEE Transactions on Intelligent Transportation …, 2020 - ieeexplore.ieee.org

Academic research in the field of autonomous vehicles has reached high popularity in
recent years related to several topics as sensor technologies, V2X communications, safety …

被引用次数：461 相关文章所有 9 个版本

[HTML] science.org

Human-level performance in 3D multiplayer games with population-based reinforcement learning

M Jaderberg, WM Czarnecki, I Dunning, L Marris… - Science, 2019 - science.org

Reinforcement learning (RL) has shown great success in increasingly complex single-agent
environments and two-player turn-based games. However, the real world contains multiple …

被引用次数：947 相关文章所有 12 个版本

高级搜索

QQ 群