Potential and pitfalls of multi-armed bandits for decentralized spatial reuse in WLANs

F Wilhelmi, S Barrachina-Munoz, B Bellalta… - Journal of Network and …, 2019 - Elsevier
Spatial Reuse (SR) has recently gained attention to maximize the performance of IEEE
802.11 Wireless Local Area Networks (WLANs). Decentralized mechanisms are expected to …

Applying deep reinforcement learning to improve throughput and reduce collision rate in IEEE 802.11 networks

CH Ke, L Astuti - KSII Transactions on Internet and Information …, 2022 - koreascience.kr
Abstract The effectiveness of Wi-Fi networks is greatly influenced by the optimization of
contention window (CW) parameters. Unfortunately, the conventional approach employed …

[HTML][HTML] A survey on applications of reinforcement learning in flying ad-hoc networks

S Rezwan, W Choi - Electronics, 2021 - mdpi.com
Flying ad-hoc networks (FANET) are one of the most important branches of wireless ad-hoc
networks, consisting of multiple unmanned air vehicles (UAVs) performing assigned tasks …

Q-learning for cognitive radios

N Hosey, S Bergin, I Macaluso… - Proceedings of the …, 2009 - mural.maynoothuniversity.ie
Machine Learning approaches such as Reinforcement Learning (RL) can be used to solve
problems such as spectrum sensing and channel allocation in the cognitive radio domain …

ns3-gym: Extending openai gym for networking research

P Gawłowicz, A Zubow - arXiv preprint arXiv:1810.03943, 2018 - arxiv.org
OpenAI Gym is a toolkit for reinforcement learning (RL) research. It includes a large number
of well-known problems that expose a common interface allowing to directly compare the …

[PDF][PDF] Multi-objective reinforcement learning with non-linear scalarization

M Agarwal, V Aggarwal, T Lan - Proceedings of the 21st …, 2022 - aamas.csc.liv.ac.uk
In many real-world problems, an agent simultaneously optimizes multiple rewards [7, 35,
40]. Further, more often than not, the objectives can be conflicting. Typical examples for such …

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y Xie, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press
Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

Q-learning based co-operative spectrum mobility in cognitive radio networks

A Das, SC Ghosh, N Das… - 2017 IEEE 42nd …, 2017 - ieeexplore.ieee.org
In cognitive radio systems, fast and efficient spectrum selection is a vital task to minimize the
overhead of spectrum scanning, and hence to improve the response time of the system. So …

Comparing exploration strategies for Q-learning in random stochastic mazes

AD Tijsma, MM Drugan… - 2016 IEEE symposium …, 2016 - ieeexplore.ieee.org
Balancing the ratio between exploration and exploitation is an important problem in
reinforcement learning. This paper evaluates four different exploration strategies combined …

S4rl: Surprisingly simple self-supervision for offline reinforcement learning in robotics

S Sinha, A Mandlekar, A Garg - Conference on Robot …, 2022 - proceedings.mlr.press
Offline reinforcement learning proposes to learn policies from large collected datasets
without interacting with the physical environment. These algorithms have made it possible to …