相关文章- 学术资源搜索

[PDF][PDF] Configurable environments in reinforcement learning: An overview

AM Metelli - Special Topics in Information Technology, 2022 - library.oapen.org

Reinforcement Learning (RL) has emerged as an effective approach to address a variety of
complex control tasks. In a typical RL problem, an agent interacts with the environment by …

被引用次数：7 相关文章所有 11 个版本

[PDF] polimi.it

[图书][B] Exploiting environment configurability in reinforcement learning

AM Metelli - 2022 - books.google.com

In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to
address complex control tasks. In a Markov Decision Process (MDP), the framework typically …

被引用次数：5 相关文章所有 5 个版本

[PDF] mlr.press

Configurable Markov decision processes

AM Metelli, M Mutti, M Restelli - International Conference on …, 2018 - proceedings.mlr.press

In many real-world problems, there is the possibility to configure, to a limited extent, some
environmental parameters to improve the performance of a learning agent. In this paper, we …

被引用次数：47 相关文章所有 9 个版本

[PDF] mlr.press

Reinforcement learning in configurable continuous environments

AM Metelli, E Ghelfi, M Restelli - International Conference on …, 2019 - proceedings.mlr.press

Abstract Configurable Markov Decision Processes (Conf-MDPs) have been recently
introduced as an extension of the usual MDP model to account for the possibility of …

被引用次数：18 相关文章所有 11 个版本

[PDF] nsf.gov

[PDF][PDF] Policy learning with constraints in model-free reinforcement learning: A survey

Y Liu, A Halev, X Liu - The 30th international joint conference on artificial …, 2021 - par.nsf.gov

Reinforcement Learning (RL) algorithms have had tremendous success in simulated
domains. These algorithms, however, often cannot be directly applied to physical systems …

被引用次数：112 相关文章所有 6 个版本

[PDF] neurips.cc

Learning in non-cooperative configurable markov decision processes

G Ramponi, AM Metelli, A Concetti… - Advances in Neural …, 2021 - proceedings.neurips.cc

Abstract The Configurable Markov Decision Process framework includes two entities: a
Reinforcement Learning agent and a configurator that can modify some environmental …

被引用次数：11 相关文章所有 9 个版本

[PDF] arxiv.org

Environment Shaping in Reinforcement Learning using State Abstraction

P Kamalaruban, R Devidze, V Cevher… - arXiv preprint arXiv …, 2020 - arxiv.org

One of the central challenges faced by a reinforcement learning (RL) agent is to effectively
learn a (near-) optimal policy in environments with large state spaces having sparse and …

被引用次数：5 相关文章所有 3 个版本

[PDF] psu.edu

[PDF][PDF] Reinforcement learning with heterogeneous policy representations

P Kormushev, DG Caldwell - … Learning (EWRL 2013) held as a Dagstuhl …, 2013 - Citeseer

In Reinforcement Learning (RL) the goal is to find a policy π that maximizes the expected
future return, calculated based on a scalar reward function R (·)∈ R. The policy π …

被引用次数：5 相关文章所有 5 个版本

[PDF] springer.com

Policy space identification in configurable environments

AM Metelli, G Manneschi, M Restelli - Machine Learning, 2022 - Springer

We study the problem of identifying the policy space available to an agent in a learning
process, having access to a set of demonstrations generated by the agent playing the …

被引用次数：15 相关文章所有 11 个版本

[PDF] openreview.net

Robust domain randomization for reinforcement learning

RB Slaoui, WR Clements, JN Foerster, S Toth - 2019 - openreview.net

Producing agents that can generalize to a wide range of environments is a significant
challenge in reinforcement learning. One method for overcoming this issue is domain …

被引用次数：17 相关文章

高级搜索

QQ 群