Control frequency adaptation via action persistence in batch reinforcement learning

AM Metelli, F Mazzolini, L Bisi… - International …, 2020 - proceedings.mlr.press
The choice of the control frequency of a system has a relevant impact on the ability of
reinforcement learning algorithms to learn a highly performing policy. In this paper, we …

[PDF][PDF] Configurable environments in reinforcement learning: An overview

AM Metelli - Special Topics in Information Technology, 2022 - library.oapen.org
Reinforcement Learning (RL) has emerged as an effective approach to address a variety of
complex control tasks. In a typical RL problem, an agent interacts with the environment by …

Learning in non-cooperative configurable markov decision processes

G Ramponi, AM Metelli, A Concetti… - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract The Configurable Markov Decision Process framework includes two entities: a
Reinforcement Learning agent and a configurator that can modify some environmental …

Online Markov Decision Processes Configuration with Continuous Decision Space

D Maran, P Olivieri, FE Stradi, G Urso, N Gatti… - Proceedings of the …, 2024 - ojs.aaai.org
In this paper, we investigate the optimal online configuration of episodic Markov decision
processes when the space of the possible configurations is continuous. Specifically, we …

Policy space identification in configurable environments

AM Metelli, G Manneschi, M Restelli - Machine Learning, 2022 - Springer
We study the problem of identifying the policy space available to an agent in a learning
process, having access to a set of demonstrations generated by the agent playing the …

[图书][B] Exploiting environment configurability in reinforcement learning

AM Metelli - 2022 - books.google.com
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to
address complex control tasks. In a Markov Decision Process (MDP), the framework typically …

Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs

R Poiani, C Stirbu, AM Metelli… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the continuous growth of the global economy and markets, resource imbalance has
risen to be one of the central issues in real logistic scenarios. In marine transportation, this …

Performance Improvement Bounds for Lipschitz Configurable Markov Decision Processes

AM Metelli - arXiv preprint arXiv:2402.13821, 2024 - arxiv.org
Configurable Markov Decision Processes (Conf-MDPs) have recently been introduced as an
extension of the traditional Markov Decision Processes (MDPs) to model the real-world …

Towards alignment of Reinforcement Learning agents; for consideration of safety, robustness and fairness.

H Satija - 2024 - escholarship.mcgill.ca
Reinforcement Learning (RL) has emerged as the standard paradigm for sequential
decision-making and a framework for general intelligence. At its core, the RL problem is one …

A unified view of configurable Markov Decision Processes: Solution concepts, value functions, and operators

AM Metelli - Intelligenza Artificiale, 2022 - content.iospress.com
In this paper, we provide a unified presentation of the Configurable Markov Decision
Process (Conf-MDP) framework. A Conf-MDP is an extension of the traditional Markov …