Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arXiv preprint arXiv:2011.00583, 2020 - arxiv.org
Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Reinforcement learning in robotic applications: a comprehensive survey

B Singh, R Kumar, VP Singh - Artificial Intelligence Review, 2022 - Springer
In recent trends, artificial intelligence (AI) is used for the creation of complex automated
control systems. Still, researchers are trying to make a completely autonomous system that …

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y Xie, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press
Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

Mean field multi-agent reinforcement learning

Y Yang, R Luo, M Li, M Zhou… - … on machine learning, 2018 - proceedings.mlr.press
Existing multi-agent reinforcement learning methods are limited typically to a small number
of agents. When the agent number increases largely, the learning becomes intractable due …

Reinforcement learning-based physical cross-layer security and privacy in 6G

X Lu, L Xiao, P Li, X Ji, C Xu, S Yu… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
Sixth-generation (6G) cellular systems will have an inherent vulnerability to physical (PHY)-
layer attacks and privacy leakage, due to the large-scale heterogeneous networks with …

Deep reinforcement learning from self-play in imperfect-information games

J Heinrich, D Silver - arXiv preprint arXiv:1603.01121, 2016 - arxiv.org
Many real-world applications can be described as large-scale games of imperfect
information. To deal with these challenging domains, prior work has focused on computing …

Intelligent reflecting surface assisted anti-jamming communications: A fast reinforcement learning approach

H Yang, Z Xiong, J Zhao, D Niyato, Q Wu… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Malicious jamming launched by smart jammers can attack legitimate transmissions, which
has been regarded as one of the critical security challenges in wireless communications …

UAV relay in VANETs against smart jamming with reinforcement learning

L Xiao, X Lu, D Xu, Y Tang, L Wang… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Frequency hopping-based antijamming techniques are not always applicable in vehicular
ad hoc networks (VANETs) due to the high mobility of onboard units (OBUs) and the large …

Dealing with non-stationarity in multi-agent deep reinforcement learning

G Papoudakis, F Christianos, A Rahman… - arXiv preprint arXiv …, 2019 - arxiv.org
Recent developments in deep reinforcement learning are concerned with creating decision-
making agents which can perform well in various complex domains. A particular approach …