Applications of multi-agent reinforcement learning in future internet: A comprehensive survey

T Li, K Zhu, NC Luong, D Niyato, Q Wu… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
Future Internet involves several emerging technologies such as 5G and beyond 5G
networks, vehicular networks, unmanned aerial vehicle (UAV) networks, and Internet of …

Leveraging deep reinforcement learning for traffic engineering: A survey

Y Xiao, J Liu, J Wu, N Ansari - IEEE Communications Surveys & …, 2021 - ieeexplore.ieee.org
After decades of unprecedented development, modern networks have evolved far beyond
expectations in terms of scale and complexity. In many cases, traditional traffic engineering …

A metaverse-based teaching building evacuation training system with deep reinforcement learning

J Gu, J Wang, X Guo, G Liu, S Qin… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the development of IoT, virtual reality, cloud computing, and digital twin technologies,
the advent of metaverse has attracted increasing world attention. Metaverse integrates and …

Marl sim2real transfer: Merging physical reality with digital virtuality in metaverse

H Shi, G Liu, K Zhang, Z Zhou… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Metaverse is an artificial virtual world mapped from and interacting with the real world. In
metaverse, digital entities coexist with their physical counterparts. Powered by deep …

Hierarchical deep reinforcement learning with experience sharing for metaverse in education

R Hare, Y Tang - IEEE Transactions on Systems, Man, and …, 2022 - ieeexplore.ieee.org
Metaverse has gained increasing interest in education, with much of literature focusing on its
great potential to enhance both individual and social aspects of learning. However, little …

Dynamic job-shop scheduling problems using graph neural network and deep reinforcement learning

CL Liu, TH Huang - IEEE Transactions on Systems, Man, and …, 2023 - ieeexplore.ieee.org
The job-shop scheduling problem (JSSP) is one of the best-known combinatorial
optimization problems and is also an essential task in various sectors. In most real-world …

QR-SDN: Towards reinforcement learning states, actions, and rewards for direct flow routing in software-defined networks

J Rischke, P Sossalla, H Salah, FHP Fitzek… - IEEE …, 2020 - ieeexplore.ieee.org
Flow routing can achieve fine-grained network performance optimizations by routing distinct
packet traffic flows over different network paths. While the centralized control of Software …

Multiagent meta-reinforcement learning for adaptive multipath routing optimization

L Chen, B Hu, ZH Guan, L Zhao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this article, we investigate the routing problem of packet networks through multiagent
reinforcement learning (RL), which is a very challenging topic in distributed and autonomous …

Proximal policy optimization with policy feedback

Y Gu, Y Cheng, CLP Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Proximal policy optimization (PPO) is a deep reinforcement learning algorithm based on the
actor–critic (AC) architecture. In the classic AC architecture, the Critic (value) network is used …

Mean-field multiagent reinforcement learning: A decentralized network approach

H Gu, X Guo, X Wei, R Xu - Mathematics of Operations …, 2024 - pubsonline.informs.org
One of the challenges for multiagent reinforcement learning (MARL) is designing efficient
learning algorithms for a large system in which each agent has only limited or partial …