相关文章- 学术资源搜索

Stateless reinforcement learning for multi-agent systems: The case of spectrum allocation in dynamic channel bonding WLANs

S Barrachina-Muñoz, A Chiumento… - 2021 Wireless Days …, 2021 - ieeexplore.ieee.org

Spectrum allocation in the form of primary channel and bandwidth selection is a key factor
for dynamic channel bonding (DCB) wireless local area networks (WLANs). To cope with …

被引用次数：11 相关文章所有 6 个版本

[PDF] arxiv.org

Applications of deep reinforcement learning in communications and networking: A survey

NC Luong, DT Hoang, S Gong, D Niyato… - … surveys & tutorials, 2019 - ieeexplore.ieee.org

This paper presents a comprehensive literature review on applications of deep
reinforcement learning (DRL) in communications and networking. Modern networks, eg …

被引用次数：1718 相关文章所有 7 个版本

[PDF] ieee.org

Distributed learning in wireless networks: Recent progress and future challenges

M Chen, D Gündüz, K Huang, W Saad… - IEEE Journal on …, 2021 - ieeexplore.ieee.org

The next-generation of wireless networks will enable many machine learning (ML) tools and
applications to efficiently analyze various types of data collected by edge devices for …

被引用次数：383 相关文章所有 14 个版本

[PDF] academia.edu

Cooperative Q‐learning techniques for distributed online power allocation in femtocell networks

H Saad, A Mohamed, T ElBatt - Wireless Communications and …, 2015 - Wiley Online Library

In this paper, we address the problem of distributed interference management of femtocells
that share the same frequency band with macrocells using distributed multi‐agent Q …

被引用次数：15 相关文章所有 7 个版本

[PDF] aau.dk

Distributed channel allocation for mobile 6G subnetworks via multi-agent deep Q-learning

R Adeogun, G Berardinelli - 2023 IEEE Wireless …, 2023 - ieeexplore.ieee.org

Sixth generation (6G) in-X subnetworks are recently proposed as short-range low-power
radio cells for supporting localized extreme wireless connectivity inside entities such as …

被引用次数：5 相关文章所有 3 个版本

[PDF] ieee.org

Enhanced off-policy reinforcement learning with focused experience replay

SH Kong, IMA Nahrendra, DH Paek - IEEE Access, 2021 - ieeexplore.ieee.org

Utilizing the collected experience tuples in the replay buffer (RB) is the primary way of
exploiting the experiences in the off-policy reinforcement learning (RL) algorithms, and …

被引用次数：9 相关文章所有 5 个版本

[PDF] mlr.press

Distributionally Robust -Learning

Z Liu, Q Bai, J Blanchet, P Dong, W Xu… - International …, 2022 - proceedings.mlr.press

Reinforcement learning (RL) has demonstrated remarkable achievements in simulated
environments. However, carrying this success to real environments requires the important …

被引用次数：39 相关文章所有 3 个版本

[PDF] arxiv.org

Scalable and sample efficient distributed policy gradient algorithms in multi-agent networked systems

X Liu, H Wei, L Ying - arXiv preprint arXiv:2212.06357, 2022 - arxiv.org

This paper studies a class of multi-agent reinforcement learning (MARL) problems where the
reward that an agent receives depends on the states of other agents, but the next state only …

被引用次数：6 相关文章所有 2 个版本

[PDF] neurips.cc

Bail: Best-action imitation learning for batch deep reinforcement learning

X Chen, Z Zhou, Z Wang, C Wang… - Advances in Neural …, 2020 - proceedings.neurips.cc

There has recently been a surge in research in batch Deep Reinforcement Learning (DRL),
which aims for learning a high-performing policy from a given dataset without additional …

被引用次数：119 相关文章所有 8 个版本

Decentralized spectrum learning and access adaptive to channel availability distribution in primary network

M Zandi, M Dong, A Grami - 2013 IEEE 14th Workshop on …, 2013 - ieeexplore.ieee.org

We consider the effect of the mean availability distribution of primary channels on the
performance of distributed learning and access policies, and develop a distributed learning …

被引用次数：16 相关文章所有 2 个版本

高级搜索

QQ 群