[PDF][PDF] Configurable environments in reinforcement learning: An overview

AM Metelli - Special Topics in Information Technology, 2022 - library.oapen.org
Reinforcement Learning (RL) has emerged as an effective approach to address a variety of
complex control tasks. In a typical RL problem, an agent interacts with the environment by …

[图书][B] Exploiting environment configurability in reinforcement learning

AM Metelli - 2022 - books.google.com
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to
address complex control tasks. In a Markov Decision Process (MDP), the framework typically …

Configurable Markov decision processes

AM Metelli, M Mutti, M Restelli - International Conference on …, 2018 - proceedings.mlr.press
In many real-world problems, there is the possibility to configure, to a limited extent, some
environmental parameters to improve the performance of a learning agent. In this paper, we …

Reinforcement learning in configurable continuous environments

AM Metelli, E Ghelfi, M Restelli - International Conference on …, 2019 - proceedings.mlr.press
Abstract Configurable Markov Decision Processes (Conf-MDPs) have been recently
introduced as an extension of the usual MDP model to account for the possibility of …

[PDF][PDF] Policy learning with constraints in model-free reinforcement learning: A survey

Y Liu, A Halev, X Liu - The 30th international joint conference on artificial …, 2021 - par.nsf.gov
Reinforcement Learning (RL) algorithms have had tremendous success in simulated
domains. These algorithms, however, often cannot be directly applied to physical systems …

Learning in non-cooperative configurable markov decision processes

G Ramponi, AM Metelli, A Concetti… - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract The Configurable Markov Decision Process framework includes two entities: a
Reinforcement Learning agent and a configurator that can modify some environmental …

Environment Shaping in Reinforcement Learning using State Abstraction

P Kamalaruban, R Devidze, V Cevher… - arXiv preprint arXiv …, 2020 - arxiv.org
One of the central challenges faced by a reinforcement learning (RL) agent is to effectively
learn a (near-) optimal policy in environments with large state spaces having sparse and …

[PDF][PDF] Reinforcement learning with heterogeneous policy representations

P Kormushev, DG Caldwell - … Learning (EWRL 2013) held as a Dagstuhl …, 2013 - Citeseer
In Reinforcement Learning (RL) the goal is to find a policy π that maximizes the expected
future return, calculated based on a scalar reward function R (·)∈ R. The policy π …

Policy space identification in configurable environments

AM Metelli, G Manneschi, M Restelli - Machine Learning, 2022 - Springer
We study the problem of identifying the policy space available to an agent in a learning
process, having access to a set of demonstrations generated by the agent playing the …

Robust domain randomization for reinforcement learning

RB Slaoui, WR Clements, JN Foerster, S Toth - 2019 - openreview.net
Producing agents that can generalize to a wide range of environments is a significant
challenge in reinforcement learning. One method for overcoming this issue is domain …