Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arXiv preprint arXiv:2011.00583, 2020 - arxiv.org
Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

[图书][B] A concise introduction to decentralized POMDPs

FA Oliehoek, C Amato - 2016 - Springer
This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …

Cooperative inverse reinforcement learning

D Hadfield-Menell, SJ Russell… - Advances in neural …, 2016 - proceedings.neurips.cc
For an autonomous system to be helpful to humans and to pose no unwarranted risks, it
needs to align its values with those of the humans in its environment in such a way that its …

Combining deep reinforcement learning and search for imperfect-information games

N Brown, A Bakhtin, A Lerer… - Advances in Neural …, 2020 - proceedings.neurips.cc
The combination of deep reinforcement learning and search at both training and test time is
a powerful paradigm that has led to a number of successes in single-agent settings and …

On improving model-free algorithms for decentralized multi-agent reinforcement learning

W Mao, L Yang, K Zhang… - … Conference on Machine …, 2022 - proceedings.mlr.press
Multi-agent reinforcement learning (MARL) algorithms often suffer from an exponential
sample complexity dependence on the number of agents, a phenomenon known as the …

System level synthesis

J Anderson, JC Doyle, SH Low, N Matni - Annual Reviews in Control, 2019 - Elsevier
This article surveys the System Level Synthesis framework, which presents a novel
perspective on constrained robust and optimal controller synthesis for linear systems. We …

Bayesian action decoder for deep multi-agent reinforcement learning

J Foerster, F Song, E Hughes, N Burch… - International …, 2019 - proceedings.mlr.press
When observing the actions of others, humans make inferences about why they acted as
they did, and what this implies about the world; humans also use the fact that their actions …

Provably efficient reinforcement learning in decentralized general-sum markov games

W Mao, T Başar - Dynamic Games and Applications, 2023 - Springer
This paper addresses the problem of learning an equilibrium efficiently in general-sum
Markov games through decentralized multi-agent reinforcement learning. Given the …