查看文章

Partially observable multi-agent deep reinforcement learning for cognitive resource management

作者

Ning Yang, Haijun Zhang, Randall Berry

发表日期

2020/12/7

研讨会论文

GLOBECOM 2020-2020 IEEE Global Communications Conference

页码范围

1-6

出版商

IEEE

简介

In this paper, the problem of dynamic resource management in a cognitive radio network (CRN) with multiple primary users (PUs), multiple secondary users (SUs), and multiple channels is investigated. An optimization problem is formulated as a multi-agent partially observable Markov decision process (POMDP) problem in a dynamic and not fully observable environment. We consider using deep reinforcement learning (DRL) to address this problem. Based on the channel occupancy of PUs, a multi-agent deep Q-network (DQN)-based dynamic joint spectrum access and mode selection (SAMS) scheme is proposed for the SUs in the partially observable environment. The current observation of each SU is mapped to a suitable action. Each secondary user (SU) takes its own decision without exchanging information with other SUs. It seeks to maximize the total sum rate. Simulation results verify the effectiveness of our …

引用总数

被引用次数：22

2021202220233 8 11

学术搜索中的文章

Partially observable multi-agent deep reinforcement learning for cognitive resource management

N Yang, H Zhang, R Berry - GLOBECOM 2020-2020 IEEE Global Communications …, 2020

被引用次数：22 相关文章所有 3 个版本