Model-based reinforcement learning in continuous environments using real-time constrained optimization

O Andersson, F Heintz, P Doherty - … of the AAAI Conference on Artificial …, 2015 - ojs.aaai.org
… robot as well as constraints imposed for its safe operation. In this paper we propose a
modelbased reinforcement learning approach for continuous environments with constraints. The …

Reinforcement learning for an intelligent and autonomous production control of complex job-shops under time constraints

T Altenmüller, T Stüker, B Waschneck, A Kuhnle… - Production …, 2020 - Springer
… respect to time-constraints beats heuristic … self-learning agent can successfully manage
time constraints with the agent performing better than the traditional benchmark, a time-constraint

Policy learning with constraints in model-free reinforcement learning: A survey

Y Liu, A Halev, X Liu - The 30th international joint conference on artificial …, 2021 - par.nsf.gov
time constraints. … constraints: cumulative and instantaneous. In order to evaluate algorithm
performance, we briefly introduce benchmarks and metrics for constrained reinforcement learn

Improving learning & reducing time: A constrained action-based reinforcement learning approach

S Shen, MS Ausin, B Mostafavi, M Chi - … of the 26th conference on user …, 2018 - dl.acm.org
… expected cumulative reward while obeying the action-based constraints. For example, the
constraints in our application limit the total number of times that PS and WE can be selected at …

RTP-Q: A Reinforcement Learning System with Time Constraints Exploration Planning for Accelerating the Learning Rate

G Zhao, S Tatsumi, R Sun - IEICE transactions on fundamentals of …, 1999 - search.ieice.org
learning and planning with constrained time, however, the exploration is not active. This
paper proposes a RTP-Q reinforcement learning … environment into time constraints exploration …

[图书][B] Reinforcement learning under space and time constraints

HH van Seijen - 2011 - researchgate.net
… The time constraint in the title of this thesis reflects this idea; it refers to the computational
time available in between observing a sample and selecting the next action, ie, the time

Scalable multi-product inventory control with lead time constraints using reinforcement learning

H Meisheri, NN Sultana, M Baranwal, V Baniwal… - Neural Computing and …, 2022 - Springer
reinforcement learning (RL) approach for a multi-product inventory control problem with different
lead times… We also develop the equivalent reinforcement learning formulation, while the …

Optimal options for multi-task reinforcement learning under time constraints

M Del Verme, BC Da Silva, G Baldassarre - arXiv preprint arXiv …, 2020 - arxiv.org
Reinforcement learning can greatly benefit from the use of options as a way of encoding
recurring behaviours and to foster exploration. An important open problem is how can an agent …

Reinforcement learning for assignment problem with time constraints

S Pathan, V Shrivastava - arXiv preprint arXiv:2106.02856, 2021 - arxiv.org
… The reinforcement learning agent learns this policy to … with fixed time constraints using
reinforcement learning to plan … dynamic elements, and time constraints within which the nodes (…

Logically-constrained reinforcement learning

M Hasanbeig, A Abate, D Kroening - arXiv preprint arXiv:1801.08099, 2018 - arxiv.org
Reinforcement Learning (RL) algorithm to synthesise policies for an unknown Markov Decision
Process (MDP), such that a linear time … using real-time constrained optimization. In: AAAI. …