Reinforcement learning in practice: Opportunities and challenges

Bandit algorithms: A comprehensive review and their dynamic selection from a portfolio for multicriteria top-k recommendation

A Letard, N Gutowski, O Camp, T Amghar - Expert Systems with …, 2024 - Elsevier

This paper discusses the use of portfolio approaches based on bandit algorithms to optimize
multicriteria decision-making in recommender systems (accuracy and diversity). While …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Evolutionary Reinforcement Learning: A Systematic Review and Future Directions

Y Lin, F Lin, G Cai, H Chen, L Zou, P Wu - arXiv preprint arXiv:2402.13296, 2024 - arxiv.org

In response to the limitations of reinforcement learning and evolutionary algorithms (EAs) in
complex problem-solving, Evolutionary Reinforcement Learning (EvoRL) has emerged as a …

被引用次数：1 相关文章所有 2 个版本

Improving proximal policy optimization with alpha divergence

H Xu, Z Yan, J Xuan, G Zhang, J Lu - Neurocomputing, 2023 - Elsevier

Proximal policy optimization (PPO) is a recent advancement in reinforcement learning,
which is formulated as an unconstrained optimization problem including two terms …

被引用次数：5 相关文章所有 3 个版本

[HTML] mdpi.com

[HTML][HTML] An on-site-based opportunistic routing protocol for scalable and energy-efficient underwater acoustic sensor networks

R Zhu, X Huang, X Huang, D Li, Q Yang - Applied Sciences, 2022 - mdpi.com

With the advancements in wireless sensor networks and the Internet of Underwater Things
(IoUT), underwater acoustic sensor networks (UASNs) have attracted much attention, which …

被引用次数：7 相关文章所有 7 个版本

[PDF] arxiv.org

Applications of Reinforcement Learning in Finance--Trading with a Double Deep Q-Network

F Zejnullahu, M Moser, J Osterrieder - arXiv preprint arXiv:2206.14267, 2022 - arxiv.org

This paper presents a Double Deep Q-Network algorithm for trading single assets, namely
the E-mini S&P 500 continuous futures contract. We use a proven setup as the foundation for …

被引用次数：5 相关文章所有 11 个版本

[PDF] arxiv.org

[PDF] researchsquare.com

Cloud Elasticity of Microservices-based Applications: A Survey

MH Fourati, S Marzouk, M Jmaiel - 2024 - researchsquare.com

Elasticity is an essential treatment in Cloudenvironment employed in academic and
industrial contexts. The main purpose of elasticity is to reduce thedeployment cost while …

高级搜索

QQ 群