相关文章- 学术资源搜索

ns3-gym: Extending openai gym for networking research

P Gawłowicz, A Zubow - arXiv preprint arXiv:1810.03943, 2018 - arxiv.org

OpenAI Gym is a toolkit for reinforcement learning (RL) research. It includes a large number
of well-known problems that expose a common interface allowing to directly compare the …

被引用次数：72 相关文章所有 9 个版本

[PDF] tue.nl

Comparing exploration strategies for Q-learning in random stochastic mazes

AD Tijsma, MM Drugan… - 2016 IEEE symposium …, 2016 - ieeexplore.ieee.org

Balancing the ratio between exploration and exploitation is an important problem in
reinforcement learning. This paper evaluates four different exploration strategies combined …

被引用次数：163 相关文章所有 11 个版本

[PDF] mlr.press

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y Xie, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press

Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

被引用次数：711 相关文章所有 9 个版本

[PDF] researchgate.net

Wireless data acquisition for edge learning: Importance-aware retransmission

D Liu, G Zhu, J Zhang, K Huang - 2019 IEEE 20th International …, 2019 - ieeexplore.ieee.org

By deploying machine learning algorithms at the network edge, edge learning recently
emerges as a promising framework to support intelligent mobile services. It effectively …

被引用次数：23 相关文章所有 4 个版本

[HTML] mdpi.com

[HTML][HTML] A survey on applications of reinforcement learning in flying ad-hoc networks

S Rezwan, W Choi - Electronics, 2021 - mdpi.com

Flying ad-hoc networks (FANET) are one of the most important branches of wireless ad-hoc
networks, consisting of multiple unmanned air vehicles (UAVs) performing assigned tasks …

被引用次数：45 相关文章所有 3 个版本

[PDF] mlr.press

Diagnosing bottlenecks in deep q-learning algorithms

J Fu, A Kumar, M Soh, S Levine - … Conference on Machine …, 2019 - proceedings.mlr.press

Q-learning methods are a common class of algorithms used in reinforcement learning (RL).
However, their behavior with function approximation, especially with neural networks, is …

被引用次数：151 相关文章所有 7 个版本

[PDF] researchgate.net

Q-learning based co-operative spectrum mobility in cognitive radio networks

A Das, SC Ghosh, N Das… - 2017 IEEE 42nd …, 2017 - ieeexplore.ieee.org

In cognitive radio systems, fast and efficient spectrum selection is a vital task to minimize the
overhead of spectrum scanning, and hence to improve the response time of the system. So …

被引用次数：30 相关文章所有 5 个版本

Stochastic power adaptation with multiagent reinforcement learning for cognitive wireless mesh networks

X Chen, Z Zhao, H Zhang - IEEE transactions on mobile …, 2012 - ieeexplore.ieee.org

As the scarce spectrum resource is becoming overcrowded, cognitive radio indicates great
flexibility to improve the spectrum efficiency by opportunistically accessing the authorized …

被引用次数：85 相关文章所有 6 个版本

[PDF] eudl.eu

Learning from experts in cognitive radio networks: The docitive paradigm

A Galindo-Serrano, L Giupponi… - 2010 Proceedings of …, 2010 - ieeexplore.ieee.org

In this paper we introduce the novel paradigm of docition for cognitive radio (CR) networks.
We consider that the CRs are intelligent radios implementing a learning process through …

被引用次数：33 相关文章所有 5 个版本

[PDF] aaai.org

Deep q-learning from demonstrations

T Hester, M Vecerik, O Pietquin, M Lanctot… - Proceedings of the …, 2018 - ojs.aaai.org

Deep reinforcement learning (RL) has achieved several high profile successes in difficult
decision-making problems. However, these algorithms typically require a huge amount of …

被引用次数：1213 相关文章所有 12 个版本

高级搜索

QQ 群