[HTML][HTML] Efficient Exploration and Robustness in Controlled Dynamical Systems

A Russo - 2023 - diva-portal.org
In this thesis, we explore two distinct topics. The first part of the thesis delves into efficient
exploration in multi-task bandit models and model-free exploration in large Markov decision …

Task-optimal exploration in linear dynamical systems

AJ Wagenmaker, M Simchowitz… - … on Machine Learning, 2021 - proceedings.mlr.press
Exploration in unknown environments is a fundamental problem in reinforcement learning
and control. In this work, we study task-guided exploration and determine what precisely an …

[图书][B] Efficient model-based exploration in continuous state-space environments

A Nouri - 2011 - search.proquest.com
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about
the environment for the purpose of better decision making. As such, exploration plays a …

Model-free active exploration in reinforcement learning

A Russo, A Proutiere - Advances in Neural Information …, 2024 - proceedings.neurips.cc
We study the problem of exploration in Reinforcement Learning and present a novel model-
free solution. We adopt an information-theoretical viewpoint and start from the instance …

Exploration via planning for information about the optimal trajectory

V Mehta, I Char, J Abbate, R Conlin… - Advances in …, 2022 - proceedings.neurips.cc
Many potential applications of reinforcement learning (RL) are stymied by the large numbers
of samples required to learn an effective policy. This is especially true when applying RL to …

Directed exploration for improved sample efficiency in reinforcement learning

ZD Guo - 2019 - apps.dtic.mil
A key challenge in reinforcement learning is how an agent can efficiently gather useful
information about its environment to make the right decisions, ie, how can the agent be …

An experimental design perspective on model-based reinforcement learning

V Mehta, B Paria, J Schneider, S Ermon… - arXiv preprint arXiv …, 2021 - arxiv.org
In many practical applications of RL, it is expensive to observe state transitions from the
environment. For example, in the problem of plasma control for nuclear fusion, computing …

Deterministic sequencing of exploration and exploitation for reinforcement learning

P Gupta, V Srivastava - … IEEE 61st Conference on Decision and …, 2022 - ieeexplore.ieee.org
We propose Deterministic Sequencing of Exploration and Exploitation (DSEE) algorithm
with interleaving exploration and exploitation epochs for model-based RL problems that aim …

Near-optimal policy identification in active reinforcement learning

X Li, V Mehta, J Kirschner, I Char… - arXiv preprint arXiv …, 2022 - arxiv.org
Many real-world reinforcement learning tasks require control of complex dynamical systems
that involve both costly data acquisition processes and large state spaces. In cases where …

Flex: an adaptive exploration algorithm for nonlinear systems

M Blanke, M Lelarge - International Conference on Machine …, 2023 - proceedings.mlr.press
Abstract Model-based reinforcement learning is a powerful tool, but collecting data to fit an
accurate model of the system can be costly. Exploring an unknown environment in a sample …