Stateless reinforcement learning for multi-agent systems: The case of spectrum allocation in dynamic channel bonding WLANs

S Barrachina-Muñoz, A Chiumento… - 2021 Wireless Days …, 2021 - ieeexplore.ieee.org
Spectrum allocation in the form of primary channel and bandwidth selection is a key factor
for dynamic channel bonding (DCB) wireless local area networks (WLANs). To cope with …

Applications of deep reinforcement learning in communications and networking: A survey

NC Luong, DT Hoang, S Gong, D Niyato… - … surveys & tutorials, 2019 - ieeexplore.ieee.org
This paper presents a comprehensive literature review on applications of deep
reinforcement learning (DRL) in communications and networking. Modern networks, eg …

Distributed learning in wireless networks: Recent progress and future challenges

M Chen, D Gündüz, K Huang, W Saad… - IEEE Journal on …, 2021 - ieeexplore.ieee.org
The next-generation of wireless networks will enable many machine learning (ML) tools and
applications to efficiently analyze various types of data collected by edge devices for …

Cooperative Q‐learning techniques for distributed online power allocation in femtocell networks

H Saad, A Mohamed, T ElBatt - Wireless Communications and …, 2015 - Wiley Online Library
In this paper, we address the problem of distributed interference management of femtocells
that share the same frequency band with macrocells using distributed multi‐agent Q …

Distributed channel allocation for mobile 6G subnetworks via multi-agent deep Q-learning

R Adeogun, G Berardinelli - 2023 IEEE Wireless …, 2023 - ieeexplore.ieee.org
Sixth generation (6G) in-X subnetworks are recently proposed as short-range low-power
radio cells for supporting localized extreme wireless connectivity inside entities such as …

Enhanced off-policy reinforcement learning with focused experience replay

SH Kong, IMA Nahrendra, DH Paek - IEEE Access, 2021 - ieeexplore.ieee.org
Utilizing the collected experience tuples in the replay buffer (RB) is the primary way of
exploiting the experiences in the off-policy reinforcement learning (RL) algorithms, and …

Distributionally Robust -Learning

Z Liu, Q Bai, J Blanchet, P Dong, W Xu… - International …, 2022 - proceedings.mlr.press
Reinforcement learning (RL) has demonstrated remarkable achievements in simulated
environments. However, carrying this success to real environments requires the important …

Scalable and sample efficient distributed policy gradient algorithms in multi-agent networked systems

X Liu, H Wei, L Ying - arXiv preprint arXiv:2212.06357, 2022 - arxiv.org
This paper studies a class of multi-agent reinforcement learning (MARL) problems where the
reward that an agent receives depends on the states of other agents, but the next state only …

Bail: Best-action imitation learning for batch deep reinforcement learning

X Chen, Z Zhou, Z Wang, C Wang… - Advances in Neural …, 2020 - proceedings.neurips.cc
There has recently been a surge in research in batch Deep Reinforcement Learning (DRL),
which aims for learning a high-performing policy from a given dataset without additional …

Decentralized spectrum learning and access adaptive to channel availability distribution in primary network

M Zandi, M Dong, A Grami - 2013 IEEE 14th Workshop on …, 2013 - ieeexplore.ieee.org
We consider the effect of the mean availability distribution of primary channels on the
performance of distributed learning and access policies, and develop a distributed learning …