相关文章- 学术资源搜索

A unified game-theoretic approach to multiagent reinforcement learning

M Lanctot, V Zambaldi, A Gruslys… - Advances in neural …, 2017 - proceedings.neurips.cc

There has been a resurgence of interest in multiagent reinforcement learning (MARL), due
partly to the recent success of deep neural networks. The simplest form of MARL is …

被引用次数：742 相关文章所有 15 个版本

[PDF] mlr.press

A policy gradient algorithm for learning to learn in multiagent reinforcement learning

DK Kim, M Liu, MD Riemer, C Sun… - International …, 2021 - proceedings.mlr.press

A fundamental challenge in multiagent reinforcement learning is to learn beneficial
behaviors in a shared environment with other simultaneously learning agents. In particular …

被引用次数：61 相关文章所有 9 个版本

[PDF] aaai.org

Multi-agent learning with policy prediction

C Zhang, V Lesser - Proceedings of the AAAI Conference on Artificial …, 2010 - ojs.aaai.org

Due to the non-stationary environment, learning in multi-agent systems is a challenging
problem. This paper first introduces a new gradient-based learning algorithm, augmenting …

被引用次数：171 相关文章所有 16 个版本

[PDF] arxiv.org

Dealing with non-stationarity in multi-agent deep reinforcement learning

G Papoudakis, F Christianos, A Rahman… - arXiv preprint arXiv …, 2019 - arxiv.org

Recent developments in deep reinforcement learning are concerned with creating decision-
making agents which can perform well in various complex domains. A particular approach …

被引用次数：218 相关文章所有 2 个版本

[PDF] academia.edu

Game theory and multi-agent reinforcement learning

A Nowé, P Vrancx, YM De Hauwere - Reinforcement Learning: State-of …, 2012 - Springer

Reinforcement Learning was originally developed for Markov Decision Processes (MDPs). It
allows a single agent to learn a policy that maximizes a possibly delayed reward signal in a …

被引用次数：302 相关文章所有 6 个版本

[PDF] liv.ac.uk

[PDF][PDF] Potential-based difference rewards for multiagent reinforcement learning

S Devlin, L Yliniemi, D Kudenko… - Proceedings of the 2014 …, 2014 - aamas.csc.liv.ac.uk

Difference rewards and potential-based reward shaping can both significantly improve the
joint policy learnt by multiple reinforcement learning agents acting simultaneously in the …

被引用次数：263 相关文章所有 9 个版本

[PDF] github.io

Prediction-based multi-agent reinforcement learning in inherently non-stationary environments

A Marinescu, I Dusparic, S Clarke - ACM Transactions on Autonomous …, 2017 - dl.acm.org

Multi-agent reinforcement learning (MARL) is a widely researched technique for
decentralised control in complex large-scale autonomous systems. Such systems often …

被引用次数：69 相关文章所有 7 个版本

[PDF] vub.ac.be

[PDF][PDF] Comparative evaluation of cooperative multi-agent deep reinforcement learning algorithms

G Papoudakis, F Christianos, L Schäfer… - arXiv preprint arXiv …, 2020 - ala2021.vub.ac.be

Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used
evaluation tasks and criteria, making comparisons between approaches difficult. In this work …

被引用次数：56 相关文章所有 2 个版本

Deep reinforcement learning

M Krichen - 2023 14th International Conference on Computing …, 2023 - ieeexplore.ieee.org

Deep Reinforcement Learning (DRL) is a powerful technique for learning policies for
complex decision-making tasks. In this paper, we provide an overview of DRL, including its …

被引用次数：14 相关文章

[PDF] springer.com

Deep multiagent reinforcement learning: Challenges and directions

A Wong, T Bäck, AV Kononova, A Plaat - Artificial Intelligence Review, 2023 - Springer

This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …

被引用次数：86 相关文章所有 8 个版本

高级搜索

QQ 群