Reinforcement learning in configurable continuous environments

AM Metelli, F Mazzolini, L Bisi… - International …, 2020 - proceedings.mlr.press

The choice of the control frequency of a system has a relevant impact on the ability of
reinforcement learning algorithms to learn a highly performing policy. In this paper, we …

被引用次数：46 相关文章所有 8 个版本

[PDF] oapen.org

[PDF][PDF] Configurable environments in reinforcement learning: An overview

AM Metelli - Special Topics in Information Technology, 2022 - library.oapen.org

Reinforcement Learning (RL) has emerged as an effective approach to address a variety of
complex control tasks. In a typical RL problem, an agent interacts with the environment by …

被引用次数：7 相关文章所有 11 个版本

[PDF] neurips.cc

Learning in non-cooperative configurable markov decision processes

G Ramponi, AM Metelli, A Concetti… - Advances in Neural …, 2021 - proceedings.neurips.cc

Abstract The Configurable Markov Decision Process framework includes two entities: a
Reinforcement Learning agent and a configurator that can modify some environmental …

被引用次数：11 相关文章所有 9 个版本

[PDF] aaai.org

Online Markov Decision Processes Configuration with Continuous Decision Space

D Maran, P Olivieri, FE Stradi, G Urso, N Gatti… - Proceedings of the …, 2024 - ojs.aaai.org

In this paper, we investigate the optimal online configuration of episodic Markov decision
processes when the space of the possible configurations is continuous. Specifically, we …

被引用次数：5 相关文章

[PDF] springer.com

Policy space identification in configurable environments

AM Metelli, G Manneschi, M Restelli - Machine Learning, 2022 - Springer

We study the problem of identifying the policy space available to an agent in a learning
process, having access to a set of demonstrations generated by the agent playing the …

被引用次数：15 相关文章所有 11 个版本

[PDF] polimi.it

[图书][B] Exploiting environment configurability in reinforcement learning

AM Metelli - 2022 - books.google.com

In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to
address complex control tasks. In a Markov Decision Process (MDP), the framework typically …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

高级搜索

QQ 群