相关文章- 学术资源搜索

Potential and pitfalls of multi-armed bandits for decentralized spatial reuse in WLANs

F Wilhelmi, S Barrachina-Munoz, B Bellalta… - Journal of Network and …, 2019 - Elsevier

Spatial Reuse (SR) has recently gained attention to maximize the performance of IEEE
802.11 Wireless Local Area Networks (WLANs). Decentralized mechanisms are expected to …

被引用次数：57 相关文章所有 5 个版本

[PDF] koreascience.kr

Applying deep reinforcement learning to improve throughput and reduce collision rate in IEEE 802.11 networks

CH Ke, L Astuti - KSII Transactions on Internet and Information …, 2022 - koreascience.kr

Abstract The effectiveness of Wi-Fi networks is greatly influenced by the optimization of
contention window (CW) parameters. Unfortunately, the conventional approach employed …

被引用次数：12 相关文章所有 7 个版本

[HTML] mdpi.com

[HTML][HTML] A survey on applications of reinforcement learning in flying ad-hoc networks

S Rezwan, W Choi - Electronics, 2021 - mdpi.com

Flying ad-hoc networks (FANET) are one of the most important branches of wireless ad-hoc
networks, consisting of multiple unmanned air vehicles (UAVs) performing assigned tasks …

被引用次数：45 相关文章所有 3 个版本

[PDF] maynoothuniversity.ie

Q-learning for cognitive radios

N Hosey, S Bergin, I Macaluso… - Proceedings of the …, 2009 - mural.maynoothuniversity.ie

Machine Learning approaches such as Reinforcement Learning (RL) can be used to solve
problems such as spectrum sensing and channel allocation in the cognitive radio domain …

被引用次数：25 相关文章所有 3 个版本

[PDF] arxiv.org

ns3-gym: Extending openai gym for networking research

P Gawłowicz, A Zubow - arXiv preprint arXiv:1810.03943, 2018 - arxiv.org

OpenAI Gym is a toolkit for reinforcement learning (RL) research. It includes a large number
of well-known problems that expose a common interface allowing to directly compare the …

被引用次数：71 相关文章所有 9 个版本

[PDF] liv.ac.uk

[PDF][PDF] Multi-objective reinforcement learning with non-linear scalarization

M Agarwal, V Aggarwal, T Lan - Proceedings of the 21st …, 2022 - aamas.csc.liv.ac.uk

In many real-world problems, an agent simultaneously optimizes multiple rewards [7, 35,
40]. Further, more often than not, the objectives can be conflicting. Typical examples for such …

被引用次数：22 相关文章所有 6 个版本

[PDF] mlr.press

A theoretical analysis of deep Q-learning

J Fan, Z Wang, Y Xie, Z Yang - Learning for dynamics and …, 2020 - proceedings.mlr.press

Despite the great empirical success of deep reinforcement learning, its theoretical
foundation is less well understood. In this work, we make the first attempt to theoretically …

被引用次数：705 相关文章所有 9 个版本

[PDF] researchgate.net

Q-learning based co-operative spectrum mobility in cognitive radio networks

A Das, SC Ghosh, N Das… - 2017 IEEE 42nd …, 2017 - ieeexplore.ieee.org

In cognitive radio systems, fast and efficient spectrum selection is a vital task to minimize the
overhead of spectrum scanning, and hence to improve the response time of the system. So …

被引用次数：30 相关文章所有 5 个版本

[PDF] tue.nl

Comparing exploration strategies for Q-learning in random stochastic mazes

AD Tijsma, MM Drugan… - 2016 IEEE symposium …, 2016 - ieeexplore.ieee.org

Balancing the ratio between exploration and exploitation is an important problem in
reinforcement learning. This paper evaluates four different exploration strategies combined …

被引用次数：163 相关文章所有 11 个版本

[PDF] mlr.press

S4rl: Surprisingly simple self-supervision for offline reinforcement learning in robotics

S Sinha, A Mandlekar, A Garg - Conference on Robot …, 2022 - proceedings.mlr.press

Offline reinforcement learning proposes to learn policies from large collected datasets
without interacting with the physical environment. These algorithms have made it possible to …

被引用次数：102 相关文章所有 5 个版本

高级搜索

QQ 群