查看文章

arxiv.org 中的 [PDF]

Deep Contextual Bandit and Reinforcement Learning for IRS-assisted MU-MIMO Systems

作者

Dariel Pereira-Ruisánchez, Óscar Fresnedo, Darian Pérez-Adán, Luis Castedo

发表日期

2023/2

期刊

IEEE Transactions on Vehicular Technology

卷号

期号

页码范围

9099 - 9114

出版商

IEEE

简介

The combination of multiple-input multiple-output (MIMO) systems and intelligent reflecting surfaces (IRSs) is foreseen as a critical enabler of beyond 5G (B5G) and 6G. In this work, two different approaches are considered for the joint optimization of the IRS phase-shift matrix and MIMO precoders of an IRS-assisted multi-stream (MS) multi-user MIMO (MU-MIMO) system. Both approaches aim to maximize the system sum-rate for every channel realization. The first proposed solution is a novel contextual bandit (CB) framework with continuous state and action spaces called deep contextual bandit-oriented deep deterministic policy gradient (DCB-DDPG). The second is an innovative deep reinforcement learning (DRL) formulation where the states, actions, and rewards are selected such that the Markov decision process (MDP) property of reinforcement learning (RL) is appropriately met. Both proposals perform …

引用总数

被引用次数：12

2022202320241 7 4

学术搜索中的文章

Deep contextual bandit and reinforcement learning for IRS-assisted MU-MIMO systems

D Pereira-Ruisánchez, Ó Fresnedo, D Pérez-Adán… - IEEE Transactions on Vehicular Technology, 2023

被引用次数：12 相关文章所有 6 个版本