ns3-gym: Extending openai gym for networking research

P Gawłowicz, A Zubow - arXiv preprint arXiv:1810.03943, 2018 - arxiv.org
OpenAI Gym is a toolkit for reinforcement learning (RL) research. It includes a large number
of well-known problems that expose a common interface allowing to directly compare the …

Comparing exploration strategies for Q-learning in random stochastic mazes

AD Tijsma, MM Drugan… - 2016 IEEE symposium …, 2016 - ieeexplore.ieee.org
Balancing the ratio between exploration and exploitation is an important problem in
reinforcement learning. This paper evaluates four different exploration strategies combined …

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y Xie, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press
Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

Wireless data acquisition for edge learning: Importance-aware retransmission

D Liu, G Zhu, J Zhang, K Huang - 2019 IEEE 20th International …, 2019 - ieeexplore.ieee.org
By deploying machine learning algorithms at the network edge, edge learning recently
emerges as a promising framework to support intelligent mobile services. It effectively …

[HTML][HTML] A survey on applications of reinforcement learning in flying ad-hoc networks

S Rezwan, W Choi - Electronics, 2021 - mdpi.com
Flying ad-hoc networks (FANET) are one of the most important branches of wireless ad-hoc
networks, consisting of multiple unmanned air vehicles (UAVs) performing assigned tasks …

Diagnosing bottlenecks in deep q-learning algorithms

J Fu, A Kumar, M Soh, S Levine - … Conference on Machine …, 2019 - proceedings.mlr.press
Q-learning methods are a common class of algorithms used in reinforcement learning (RL).
However, their behavior with function approximation, especially with neural networks, is …

Q-learning based co-operative spectrum mobility in cognitive radio networks

A Das, SC Ghosh, N Das… - 2017 IEEE 42nd …, 2017 - ieeexplore.ieee.org
In cognitive radio systems, fast and efficient spectrum selection is a vital task to minimize the
overhead of spectrum scanning, and hence to improve the response time of the system. So …

Stochastic power adaptation with multiagent reinforcement learning for cognitive wireless mesh networks

X Chen, Z Zhao, H Zhang - IEEE transactions on mobile …, 2012 - ieeexplore.ieee.org
As the scarce spectrum resource is becoming overcrowded, cognitive radio indicates great
flexibility to improve the spectrum efficiency by opportunistically accessing the authorized …

Learning from experts in cognitive radio networks: The docitive paradigm

A Galindo-Serrano, L Giupponi… - 2010 Proceedings of …, 2010 - ieeexplore.ieee.org
In this paper we introduce the novel paradigm of docition for cognitive radio (CR) networks.
We consider that the CRs are intelligent radios implementing a learning process through …

Deep q-learning from demonstrations

T Hester, M Vecerik, O Pietquin, M Lanctot… - Proceedings of the …, 2018 - ojs.aaai.org
Deep reinforcement learning (RL) has achieved several high profile successes in difficult
decision-making problems. However, these algorithms typically require a huge amount of …