Deep reinforcement learning-based distributed dynamic spectrum access in multi-user multi-channel cognitive radio internet of things networks

X Zhang, Z Chen, Y Zhang, Y Liu… - IEEE Internet of Things …, 2024 - ieeexplore.ieee.org
X Zhang, Z Chen, Y Zhang, Y Liu, M Jin, T Qiu
IEEE Internet of Things Journal, 2024ieeexplore.ieee.org
Integrating cognitive radio into Internet of Things (IoT) is conducive to reducing spectrum
scarcity for large-scale IoT deployment, where a core technology is the design of spectrum
access algorithms for effective assignment of spectrum holes. However, due to the partially
observable channels and increased number of users in the Cognitive Radio IoT (CRIoT)
network, the secondary users have difficulty avoiding interferences and accessing the
spectrum quickly. This study presents a distributed dynamic spectrum access (DSA) …
Integrating cognitive radio into Internet of Things (IoT) is conducive to reducing spectrum scarcity for large-scale IoT deployment, where a core technology is the design of spectrum access algorithms for effective assignment of spectrum holes. However, due to the partially observable channels and increased number of users in the Cognitive Radio IoT (CRIoT) network, the secondary users have difficulty avoiding interferences and accessing the spectrum quickly. This study presents a distributed dynamic spectrum access (DSA) algorithm that employs a priority experience replay deep echo state -network (PER-DESQN) for CRIoT networks with multiple users and channels. To accelerate the -network convergence, we use an echo state network based on the underlying temporal correlation to estimate -values. Then, to resolve the -value overestimation and improve prediction accuracy, the estimated -value and decision action process are trained using a double deep -network (DDQN). Moreover, a priority experience replay mechanism that uses the Sum-Tree combined with importance sampling weights is proposed to optimize the DDQN to address the instability of the -value resulting from random sampling. As the simulation results demonstrate, the proposed algorithm can make fast and accurate DSA decisions and boost the network channel capacity significantly.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果