Gap-increasing policy evaluation for efficient and noise-tolerant reinforcement learning

文章

学术资源搜索

获得 3 条结果（用时0.03秒）

Gap-increasing policy evaluation for efficient and noise-tolerant reinforcement learning

Deep reinforcement learning based energy management strategies for electrified vehicles: Recent advances and perspectives

H He, X Meng, Y Wang, A Khajepour, X An… - … and Sustainable Energy …, 2024 - Elsevier

Electrified vehicles provide an effective solution to address the unfavorable impacts of fossil
fuel use in the transportation sector. Energy management strategy (EMS) is the core …

被引用次数：16 相关文章所有 4 个版本

[PDF] mlr.press

VA-learning as a more efficient alternative to Q-learning

Y Tang, R Munos, M Rowland… - … Conference on Machine …, 2023 - proceedings.mlr.press

In reinforcement learning, the advantage function is critical for policy improvement, but is
often extracted from a learned Q-function. A natural question is: Why not learn the advantage …

被引用次数：3 相关文章所有 6 个版本

[PDF] aaai.org

Robust Action Gap Increasing with Clipped Advantage Learning

Z Zhang, Y Gan, X Tan - Proceedings of the AAAI Conference on …, 2022 - ojs.aaai.org

Advantage Learning (AL) seeks to increase the action gap between the optimal action and
its competitors, so as to improve the robustness to estimation errors. However, the method …

高级搜索

QQ 群

Gap-increasing policy evaluation for efficient and noise-tolerant reinforcement learning

Deep reinforcement learning based energy management strategies for electrified vehicles: Recent advances and perspectives

VA-learning as a more efficient alternative to Q-learning

Robust Action Gap Increasing with Clipped Advantage Learning

引用