相关文章- 学术资源搜索

[HTML][HTML] Efficient Exploration and Robustness in Controlled Dynamical Systems

A Russo - 2023 - diva-portal.org

In this thesis, we explore two distinct topics. The first part of the thesis delves into efficient
exploration in multi-task bandit models and model-free exploration in large Markov decision …

被引用次数：1 相关文章所有 2 个版本

[PDF] mlr.press

Task-optimal exploration in linear dynamical systems

AJ Wagenmaker, M Simchowitz… - … on Machine Learning, 2021 - proceedings.mlr.press

Exploration in unknown environments is a fundamental problem in reinforcement learning
and control. In this work, we study task-guided exploration and determine what precisely an …

被引用次数：15 相关文章所有 5 个版本

[PDF] rutgers.edu

[图书][B] Efficient model-based exploration in continuous state-space environments

A Nouri - 2011 - search.proquest.com

The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about
the environment for the purpose of better decision making. As such, exploration plays a …

被引用次数：6 相关文章所有 5 个版本

[PDF] neurips.cc

Model-free active exploration in reinforcement learning

A Russo, A Proutiere - Advances in Neural Information …, 2024 - proceedings.neurips.cc

We study the problem of exploration in Reinforcement Learning and present a novel model-
free solution. We adopt an information-theoretical viewpoint and start from the instance …

被引用次数：1 相关文章所有 6 个版本

[PDF] neurips.cc

Exploration via planning for information about the optimal trajectory

V Mehta, I Char, J Abbate, R Conlin… - Advances in …, 2022 - proceedings.neurips.cc

Many potential applications of reinforcement learning (RL) are stymied by the large numbers
of samples required to learn an effective policy. This is especially true when applying RL to …

被引用次数：8 相关文章所有 5 个版本

[PDF] dtic.mil

Directed exploration for improved sample efficiency in reinforcement learning

ZD Guo - 2019 - apps.dtic.mil

A key challenge in reinforcement learning is how an agent can efficiently gather useful
information about its environment to make the right decisions, ie, how can the agent be …

被引用次数：1 相关文章所有 6 个版本

[PDF] arxiv.org

An experimental design perspective on model-based reinforcement learning

V Mehta, B Paria, J Schneider, S Ermon… - arXiv preprint arXiv …, 2021 - arxiv.org

In many practical applications of RL, it is expensive to observe state transitions from the
environment. For example, in the problem of plasma control for nuclear fusion, computing …

被引用次数：26 相关文章所有 3 个版本

[PDF] arxiv.org

Deterministic sequencing of exploration and exploitation for reinforcement learning

P Gupta, V Srivastava - … IEEE 61st Conference on Decision and …, 2022 - ieeexplore.ieee.org

We propose Deterministic Sequencing of Exploration and Exploitation (DSEE) algorithm
with interleaving exploration and exploitation epochs for model-based RL problems that aim …

被引用次数：4 相关文章所有 6 个版本

[PDF] arxiv.org

Near-optimal policy identification in active reinforcement learning

X Li, V Mehta, J Kirschner, I Char… - arXiv preprint arXiv …, 2022 - arxiv.org

Many real-world reinforcement learning tasks require control of complex dynamical systems
that involve both costly data acquisition processes and large state spaces. In cases where …

被引用次数：6 相关文章所有 3 个版本

[PDF] mlr.press

Flex: an adaptive exploration algorithm for nonlinear systems

M Blanke, M Lelarge - International Conference on Machine …, 2023 - proceedings.mlr.press

Abstract Model-based reinforcement learning is a powerful tool, but collecting data to fit an
accurate model of the system can be costly. Exploring an unknown environment in a sample …

被引用次数：4 相关文章所有 14 个版本

高级搜索

QQ 群

[HTML][HTML] Efficient Exploration and Robustness in Controlled Dynamical Systems

Task-optimal exploration in linear dynamical systems

[图书][B] Efficient model-based exploration in continuous state-space environments

Model-free active exploration in reinforcement learning

Exploration via planning for information about the optimal trajectory

Directed exploration for improved sample efficiency in reinforcement learning

An experimental design perspective on model-based reinforcement learning

Deterministic sequencing of exploration and exploitation for reinforcement learning

Near-optimal policy identification in active reinforcement learning

Flex: an adaptive exploration algorithm for nonlinear systems

引用