EA Feinberg, A Jaśkiewicz, AS Nowak - Automatica, 2020 - Elsevier
We study discrete-time discounted constrained Markov decision processes (CMDPs) with Borel state and action spaces. These CMDPs satisfy either weak (W) continuity conditions …
A Piunovskiy, Y Zhang - arXiv preprint arXiv:2307.03158, 2023 - arxiv.org
In this paper, we consider a Markov decision process (MDP) with a Borel state space $\textbf {X}\cup\{\Delta\} $, where $\Delta $ is an absorbing state (cemetery), and a Borel action …
X Guo, M Vykertas, Y Zhang - Advances in Applied Probability, 2013 - cambridge.org
In this paper we study absorbing continuous-time Markov decision processes in Polish state spaces with unbounded transition and cost rates, and history-dependent policies. The …
X Wu, X Zou, X Guo - Frontiers of Mathematics in China, 2015 - Springer
This paper focuses on the constrained optimality problem (COP) of first passage discrete- time Markov decision processes (DTMDPs) in denumerable state and compact Borel action …
X Wu, Q Wang, Y Kong - AIMS Mathematics, 2021 - aimspress.com
In this paper, two-person zero-sum Markov games with Borel state space and action space, unbounded reward function and state-dependent discount factors are studied. The optimal …
Q Liu, X Huang - Optimization, 2017 - Taylor & Francis
In this paper, we deal with two-person zero-sum stochastic games for discrete-time Markov processes. The optimality criterion to be studied is the discounted payoff criterion during a …
X Wu, X Guo - Science China Mathematics, 2020 - Springer
This paper is concerned with the convergence of a sequence of discrete-time Markov decision processes (DTMDPs) with constraints, state-action dependent discount factors, and …
X Wu, Y Tang - Discrete Dynamics in Nature and Society, 2022 - Wiley Online Library
In this study, the numerical calculation of optimal policy pairs in two‐person zero‐sum stochastic games with unbounded reward functions and state‐dependent discount factors …
This paper is related to Markov Decision Processes. The optimal control problem is to minimize the expected total discounted cost, with a non-constant discount factor. The …