Maximize the long-term average revenue of network slice provider via admission control among heterogeneous slices

M Dai, G Sun, H Yu, D Niyato - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
Network slicing endows 5G/B5G with differentiated and customized capabilities to cope with
the proliferation of diversified services, whereas limited physical network resources may not …

Multi-agent reinforcement learning with policy clipping and average evaluation for UAV-assisted communication Markov game

Z Feng, M Huang, D Wu, EQ Wu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Unmanned aerial vehicle (UAV)-assisted communication is a significant technology in 6G
communication. In order to cope with the dynamic trajectory optimization problem of the air …

Approximating Nash equilibrium for anti-UAV jamming Markov game using a novel event-triggered multi-agent reinforcement learning

Z Feng, M Huang, Y Wu, D Wu, J Cao, I Korovin… - Neural Networks, 2023 - Elsevier
In the downlink communication, it is currently challenging for ground users to cope with the
uncertain interference from aerial intelligent jammers. The cooperation and competition …

Modeling motorcyclist–pedestrian near misses: A multiagent adversarial inverse reinforcement learning approach

G Lanzaro, T Sayed, R Alsaleh - Journal of Computing in Civil …, 2022 - ascelibrary.org
Several studies have used surrogate safety measures obtained from microsimulation
packages, such as VISSIM, for safety assessments. However, this approach has …

Multi-agent Reinforcement Learning Clustering Algorithm Based on Silhouette Coefficient

P Du, F Li, J Shao - Neurocomputing, 2024 - Elsevier
As an important branch of emerging artificial intelligence algorithms, multi-agent
reinforcement learning (MARL) has shown strong performance in collaborative …

A distributed reinforcement learning approach for power control in wireless networks

A Ornatelli, A Tortorelli, F Liberati - 2021 IEEE World AI IoT …, 2021 - ieeexplore.ieee.org
This paper tackles the power control problem in the context of wireless networks. The
development of intelligent services based on widespread smart devices with limited energy …

基于局部位置感知的多智能体网约车调度方法.

黄晓辉, 凌嘉壕, 张雄, 熊李艳… - Journal of Computer …, 2023 - search.ebscohost.com
近年来, 网上约车成为人们日常出行不可或缺的一部分. 网约车平台的核心任务是如何有效地把
订单派送给合适的司机, 使得用户总体等待时间尽可能短, 而司机的收益尽可能高 …

Strategy Determination for Multiple USVs: A Min-max Q-learning Approach

L Hong, W Cui - International Conference on Neural Computing for …, 2023 - Springer
Abstract The application of Unmanned Surface Vehicles (USVs) has gained significant
momentum in various domains, including surveillance operations and security enforcement …

Determining the Equilibrium Solution in Two-Player Dynamic Discrete Markovian Games with transition probabilities influenced by competitor strategies

R Sadeghian - Soft Computing Journal, 2022 - scj.kashanu.ac.ir
This paper focuses on a specific type of game called Markovian dynamic game. In these
games, the strategy of each player is considered as a state of a Markov chain. At each stage …

Approximating Stackelberg Equilibrium in Anti-UAV Jamming Markov Game with Hierarchical Multi-Agent Deep Reinforcement Learning Algorithm

Z Feng, Y Wu, M Huang, D Wu - 2021 - researchsquare.com
In order to avoid the malicious jamming of the intelligent unmanned aerial vehicle (UAV) to
ground users in the downlink communications, a new anti-UAV jamming strategy based on …