time constraints reinforcement learning- 学术资源搜索

Reinforcement learning-based finite-time tracking control of an unknown unmanned surface vehicle with input constraints

N Wang, Y Gao, C Yang, X Zhang - Neurocomputing, 2022 - Elsevier

… input constraints, a reinforcement learning-based finite-time … -critic reinforcement learning
(RL) mechanism with finite-time control … which requires infinite-time convergence thereby rather …

被引用次数：45 相关文章所有 2 个版本

[PDF] arxiv.org

State augmented constrained reinforcement learning: Overcoming the limitations of learning with rewards

M Calvo-Fullana, S Paternain… - … on Automatic Control, 2023 - ieeexplore.ieee.org

… Hence, there exist constrained reinforcement learning … to solve reinforcement learning
problems with constraints. Thus, as we … 1 that this time average should satisfy the constraints with …

被引用次数：22 相关文章所有 3 个版本

[PDF] neurips.cc

Reinforcement learning with convex constraints

S Miryoosefi, K Brantley, H Daume III… - Advances in neural …, 2019 - proceedings.neurips.cc

… In standard reinforcement learning (RL), a learning agent … are more naturally expressed
as constraints. For instance, the … of constraints in RL tasks: specifically, any constraints that …

被引用次数：98 相关文章所有 18 个版本

[PDF] mlr.press

Anytime-Constrained Reinforcement Learning

J McMahan, X Zhu - International Conference on Artificial …, 2024 - proceedings.mlr.press

… Mowbray et al., 2022) yield policies that arbitrarily violate an anytime constraint. This …
blowup for anytime constraints is in the time horizon, we focus on varying the time horizon. …

被引用次数：2 相关文章所有 4 个版本

Reinforcement learning for a class of continuous-time input constrained optimal control problems

FA Yaghmaie, DJ Braun - Automatica, 2019 - Elsevier

… Reinforcement Learning frameworks solving HJB equations … free Integral Reinforcement
Learning (IRL) framework to learn the … policy u ϵ ∗ ( x ) where the control is subject to constraint. …

被引用次数：34 相关文章所有 2 个版本

[PDF] neurips.cc

Constrained reinforcement learning has zero duality gap

S Paternain, L Chamon… - Advances in Neural …, 2019 - proceedings.neurips.cc

… as completing tasks using the least amount of time/energy, learning multiple tasks, or dealing
with multiple opponents. In the context of reinforcement learning (RL), these problems are …

被引用次数：179 相关文章所有 8 个版本

Reinforcement-learning-based robust controller design for continuous-time uncertain nonlinear systems subject to input constraints

D Liu, X Yang, D Wang, Q Wei - IEEE transactions on …, 2015 - ieeexplore.ieee.org

… The constrained-input coupled with the inability to identify … of stabilizing controller based
on reinforcement-learning (RL) … -time uncertain nonlinear systems subject to input constraints. …

被引用次数：345 相关文章所有 4 个版本

[PDF] sxu.edu.cn

A reinforcement learning method for constraint-satisfied services composition

L Ren, W Wang, H Xu - IEEE Transactions on Services …, 2017 - ieeexplore.ieee.org

… is only for service level setting, and during the selection of component service we take the
constraints of composite service as a whole. Here, taking the response time constraint as an …

被引用次数：40 相关文章所有 6 个版本

[PDF] jair.org

Risk-sensitive reinforcement learning applied to control under constraints

P Geibel, F Wysotzki - Journal of Artificial Intelligence Research, 2005 - jair.org

… , and formalize it as a constrained MDP with two criteria. The … We present a model free,
heuristic reinforcement learning … a feasible solution for the constrained problem that has a good …

被引用次数：430 相关文章所有 19 个版本

[PDF] arxiv.org

Safety-constrained reinforcement learning for MDPs

S Junges, N Jansen, C Dehnert, U Topcu… - … conference on tools and …, 2016 - Springer

… reinforcement learning is Q-learning [10]. To favor the exploration, we initialize the learning
… The computation time for simulating deployment and reinforcement learning are negligible. …

被引用次数：139 相关文章所有 10 个版本

高级搜索

QQ 群