查看文章

ieee.org 中的 [PDF]

Accelerating model-free reinforcement learning with imperfect model knowledge in dynamic spectrum access

作者

Lianjun Li, Lingjia Liu, Jianan Bai, Hao-Hsuan Chang, Hao Chen, Jonathan D Ashdown, Jianzhong Zhang, Yang Yi

发表日期

2020/4/16

期刊

IEEE Internet of Things Journal

卷号

期号

页码范围

7517-7528

出版商

IEEE

简介

Current studies that apply reinforcement learning (RL) to dynamic spectrum access (DSA) problems in wireless communications systems mainly focus on model-free RL (MFRL). However, in practice, MFRL requires a large number of samples to achieve good performance making it impractical in real-time applications such as DSA. Combining model-free and model-based RL can potentially reduce the sample complexity while achieving a similar level of performance as MFRL as long as the learned model is accurate enough. However, in a complex environment, the learned model is never perfect. In this article, we combine model-free and model-based RL, and introduce an algorithm that can work with an imperfectly learned model to accelerate the MFRL. Results show our algorithm achieves higher sample efficiency than the standard MFRL algorithm and the Dyna algorithm (a standard algorithm integrating model …

引用总数

被引用次数：23

202020212022202320244 8 4 6 1

学术搜索中的文章

Accelerating model-free reinforcement learning with imperfect model knowledge in dynamic spectrum access

L Li, L Liu, J Bai, HH Chang, H Chen, JD Ashdown… - IEEE Internet of Things Journal, 2020

被引用次数：23 相关文章所有 3 个版本