A survey of reinforcement learning algorithms for dynamically varying environments

R Poiani, A Tirinzoni, M Restelli - arXiv preprint arXiv:2105.08834, 2021 - arxiv.org

Many real-world domains are subject to a structured non-stationarity which affects the
agent's goals and the environmental dynamics. Meta-reinforcement learning (RL) has been …

被引用次数：9 相关文章所有 7 个版本

[HTML] peerj.com

[HTML][HTML] Quantifying the impact of non-stationarity in reinforcement learning-based traffic signal control

LN Alegre, ALC Bazzan, BC da Silva - PeerJ Computer Science, 2021 - peerj.com

In reinforcement learning (RL), dealing with non-stationarity is a challenging issue.
However, some domains such as traffic optimization are inherently non-stationary. Causes …

被引用次数：12 相关文章所有 14 个版本

Integrating reinforcement learning with deterministic learning for fault diagnosis of nonlinear systems

Z Zhu, W Wu, T Chen, J Hu, C Wang - Neurocomputing, 2023 - Elsevier

Reliable fault diagnosis (FD) is important to ensure safety in nonlinear engineering systems.
Modern engineering systems are often subject to unknown complex nonlinearities and …

被引用次数：1 相关文章所有 2 个版本

[PDF] mdpi.com

Wineinformatics: can wine reviews in bordeaux reveal wine aging capability?

W Kwabla, F Coulibaly, Y Zhenis, B Chen - Fermentation, 2021 - mdpi.com

Wineinformatics is a new and emerging data science that uses wine as domain knowledge
and integrates data systems and wine-related data sets. Wine reviews from Wine Spectator …

被引用次数：6 相关文章所有 9 个版本

Hierarchical Reinforcement Learning-Based Routing Algorithm With Grouped RSU in Urban VANETs

Q Yang, SJ Yoo - IEEE Transactions on Intelligent …, 2024 - ieeexplore.ieee.org

The rapid growth of the Internet of Vehicles (IoV) has generated significant interest in routing
techniques for vehicular ad hoc networks (VANETs) in both academic and industrial …

Application of Reinforcement Learning to UR10 Positioning for Prioritized Multi-Step Inspection in NVIDIA Omniverse

F Bahrpeyma, A Sunilkumar… - 2023 IEEE Symposium …, 2023 - ieeexplore.ieee.org

Sequential multi-step operations have long played an important role in manufacturing
systems. In high level multi-step manufacturing processes, multiple operations are carried …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Online reinforcement learning with sparse rewards through an active inference capsule

AD Noel, C Van Hoof, B Millidge - arXiv preprint arXiv:2106.02390, 2021 - arxiv.org

Intelligent agents must pursue their goals in complex environments with partial information
and often limited computational capacity. Reinforcement learning methods have achieved …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Optimizing Heat Alert Issuance for Public Health in the United States with Reinforcement Learning

EM Considine, RC Nethery, GA Wellenius… - arXiv preprint arXiv …, 2023 - arxiv.org

Alerting the public when heat may harm their health is a crucial service, especially
considering that extreme heat events will be more frequent under climate change. Current …

DDoS and flash event detection in higher bandwidth SDN-IoT using multiagent reinforcement learning

DK Dake, JD Gadze, GS Klogo - … International Conference on …, 2021 - ieeexplore.ieee.org

The emergence of 5G, IoT, Big Data, and related technologies have necessitated a shift to
SDN architectural design and DRL algorithms for network task automation. Without prompt …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Q-FOX Learning: Breaking Tradition in Reinforcement Learning

M Alqaseer, YH Ali, TA Rashid - arXiv preprint arXiv:2402.16562, 2024 - arxiv.org

Reinforcement learning (RL) is a subset of artificial intelligence (AI) where agents learn the
best action by interacting with the environment, making it suitable for tasks that do not …

被引用次数：1 相关文章所有 3 个版本

高级搜索

QQ 群