Extreme occupation measures in Markov decision processes with an absorbing state

A Piunovskiy, Y Zhang - SIAM Journal on Control and Optimization, 2024 - SIAM
In this paper, we consider a Markov decision process (MDP) with a Borel state space, where
is an absorbing state (cemetery), and a Borel action space. We consider the space of finite …

Constrained discounted Markov decision processes with Borel state spaces

EA Feinberg, A Jaśkiewicz, AS Nowak - Automatica, 2020 - Elsevier
We study discrete-time discounted constrained Markov decision processes (CMDPs) with
Borel state and action spaces. These CMDPs satisfy either weak (W) continuity conditions …

Extreme occupation measures in Markov decision processes with a cemetery

A Piunovskiy, Y Zhang - arXiv preprint arXiv:2307.03158, 2023 - arxiv.org
In this paper, we consider a Markov decision process (MDP) with a Borel state space $\textbf
{X}\cup\{\Delta\} $, where $\Delta $ is an absorbing state (cemetery), and a Borel action …

Absorbing continuous-time Markov decision processes with total cost criteria

X Guo, M Vykertas, Y Zhang - Advances in Applied Probability, 2013 - cambridge.org
In this paper we study absorbing continuous-time Markov decision processes in Polish state
spaces with unbounded transition and cost rates, and history-dependent policies. The …

First passage Markov decision processes with constraints and varying discount factors

X Wu, X Zou, X Guo - Frontiers of Mathematics in China, 2015 - Springer
This paper focuses on the constrained optimality problem (COP) of first passage discrete-
time Markov decision processes (DTMDPs) in denumerable state and compact Borel action …

[PDF][PDF] Two-person zero-sum stochastic games with varying discount factors

X Wu, Q Wang, Y Kong - AIMS Mathematics, 2021 - aimspress.com
In this paper, two-person zero-sum Markov games with Borel state space and action space,
unbounded reward function and state-dependent discount factors are studied. The optimal …

Discrete-time zero-sum Markov games with first passage criteria

Q Liu, X Huang - Optimization, 2017 - Taylor & Francis
In this paper, we deal with two-person zero-sum stochastic games for discrete-time Markov
processes. The optimality criterion to be studied is the discounted payoff criterion during a …

Convergence of Markov decision processes with constraints and state-action dependent discount factors

X Wu, X Guo - Science China Mathematics, 2020 - Springer
This paper is concerned with the convergence of a sequence of discrete-time Markov
decision processes (DTMDPs) with constraints, state-action dependent discount factors, and …

Numerical Calculation of Optimal Policy Pairs in Zero‐sum Stochastic Games with Varying Discount Factors

X Wu, Y Tang - Discrete Dynamics in Nature and Society, 2022 - Wiley Online Library
In this study, the numerical calculation of optimal policy pairs in two‐person zero‐sum
stochastic games with unbounded reward functions and state‐dependent discount factors …

Markov decision processes with time-varying discount factors and random horizon

R Ilhuicatzi-Roldán, H Cruz-Suárez… - Kybernetika, 2017 - dml.cz
This paper is related to Markov Decision Processes. The optimal control problem is to
minimize the expected total discounted cost, with a non-constant discount factor. The …