- 学术资源搜索

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

被引用次数：220 相关文章所有 3 个版本

[PDF] arxiv.org

Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1700 相关文章所有 8 个版本

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arXiv preprint arXiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

被引用次数：350 相关文章所有 2 个版本

[PDF] fransoliehoek.net

[图书][B] A concise introduction to decentralized POMDPs

FA Oliehoek, C Amato - 2016 - Springer

This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …

被引用次数：1377 相关文章所有 13 个版本

[PDF] neurips.cc

Cooperative inverse reinforcement learning

D Hadfield-Menell, SJ Russell… - Advances in neural …, 2016 - proceedings.neurips.cc

For an autonomous system to be helpful to humans and to pose no unwarranted risks, it
needs to align its values with those of the humans in its environment in such a way that its …

被引用次数：840 相关文章所有 19 个版本

[PDF] neurips.cc

Combining deep reinforcement learning and search for imperfect-information games

N Brown, A Bakhtin, A Lerer… - Advances in Neural …, 2020 - proceedings.neurips.cc

The combination of deep reinforcement learning and search at both training and test time is
a powerful paradigm that has led to a number of successes in single-agent settings and …

被引用次数：170 相关文章所有 10 个版本

[PDF] mlr.press

On improving model-free algorithms for decentralized multi-agent reinforcement learning

W Mao, L Yang, K Zhang… - … Conference on Machine …, 2022 - proceedings.mlr.press

Multi-agent reinforcement learning (MARL) algorithms often suffer from an exponential
sample complexity dependence on the number of agents, a phenomenon known as the …

被引用次数：69 相关文章所有 5 个版本

[PDF] sciencedirect.com

System level synthesis

J Anderson, JC Doyle, SH Low, N Matni - Annual Reviews in Control, 2019 - Elsevier

This article surveys the System Level Synthesis framework, which presents a novel
perspective on constrained robust and optimal controller synthesis for linear systems. We …

被引用次数：207 相关文章所有 8 个版本

[PDF] mlr.press

Bayesian action decoder for deep multi-agent reinforcement learning

J Foerster, F Song, E Hughes, N Burch… - International …, 2019 - proceedings.mlr.press

When observing the actions of others, humans make inferences about why they acted as
they did, and what this implies about the world; humans also use the fact that their actions …

被引用次数：190 相关文章所有 6 个版本

[PDF] arxiv.org

Provably efficient reinforcement learning in decentralized general-sum markov games

W Mao, T Başar - Dynamic Games and Applications, 2023 - Springer

This paper addresses the problem of learning an equilibrium efficiently in general-sum
Markov games through decentralized multi-agent reinforcement learning. Given the …

被引用次数：85 相关文章所有 6 个版本

高级搜索

QQ 群

Ai alignment: A comprehensive survey

Multi-agent reinforcement learning: A selective overview of theories and algorithms

An overview of multi-agent reinforcement learning from game theoretical perspective

[图书][B] A concise introduction to decentralized POMDPs

Cooperative inverse reinforcement learning

Combining deep reinforcement learning and search for imperfect-information games

On improving model-free algorithms for decentralized multi-agent reinforcement learning

System level synthesis

Bayesian action decoder for deep multi-agent reinforcement learning

Provably efficient reinforcement learning in decentralized general-sum markov games

引用