Convex analytic approach to constrained discounted Markov decision processes with non-constant...

A Piunovskiy, Y Zhang - SIAM Journal on Control and Optimization, 2024 - SIAM

In this paper, we consider a Markov decision process (MDP) with a Borel state space, where
is an absorbing state (cemetery), and a Borel action space. We consider the space of finite …

被引用次数：3 相关文章所有 4 个版本

[PDF] sciencedirect.com

Constrained discounted Markov decision processes with Borel state spaces

EA Feinberg, A Jaśkiewicz, AS Nowak - Automatica, 2020 - Elsevier

We study discrete-time discounted constrained Markov decision processes (CMDPs) with
Borel state and action spaces. These CMDPs satisfy either weak (W) continuity conditions …

被引用次数：20 相关文章所有 5 个版本

[PDF] arxiv.org

Extreme occupation measures in Markov decision processes with a cemetery

A Piunovskiy, Y Zhang - arXiv preprint arXiv:2307.03158, 2023 - arxiv.org

In this paper, we consider a Markov decision process (MDP) with a Borel state space $\textbf
{X}\cup\{\Delta\} $, where $\Delta $ is an absorbing state (cemetery), and a Borel action …

被引用次数：1 相关文章所有 2 个版本

[PDF] archive.org

Absorbing continuous-time Markov decision processes with total cost criteria

X Guo, M Vykertas, Y Zhang - Advances in Applied Probability, 2013 - cambridge.org

In this paper we study absorbing continuous-time Markov decision processes in Polish state
spaces with unbounded transition and cost rates, and history-dependent policies. The …

被引用次数：16 相关文章所有 8 个版本

[PDF] researchgate.net

First passage Markov decision processes with constraints and varying discount factors

X Wu, X Zou, X Guo - Frontiers of Mathematics in China, 2015 - Springer

This paper focuses on the constrained optimality problem (COP) of first passage discrete-
time Markov decision processes (DTMDPs) in denumerable state and compact Borel action …

被引用次数：7 相关文章所有 8 个版本

[PDF] aimspress.com

[PDF][PDF] Two-person zero-sum stochastic games with varying discount factors

X Wu, Q Wang, Y Kong - AIMS Mathematics, 2021 - aimspress.com

In this paper, two-person zero-sum Markov games with Borel state space and action space,
unbounded reward function and state-dependent discount factors are studied. The optimal …

被引用次数：2 相关文章所有 3 个版本

Discrete-time zero-sum Markov games with first passage criteria

Q Liu, X Huang - Optimization, 2017 - Taylor & Francis

In this paper, we deal with two-person zero-sum stochastic games for discrete-time Markov
processes. The optimality criterion to be studied is the discounted payoff criterion during a …

被引用次数：4 相关文章

Convergence of Markov decision processes with constraints and state-action dependent discount factors

X Wu, X Guo - Science China Mathematics, 2020 - Springer

This paper is concerned with the convergence of a sequence of discrete-time Markov
decision processes (DTMDPs) with constraints, state-action dependent discount factors, and …

被引用次数：3 相关文章

[PDF] wiley.com Full View

Numerical Calculation of Optimal Policy Pairs in Zero‐sum Stochastic Games with Varying Discount Factors

X Wu, Y Tang - Discrete Dynamics in Nature and Society, 2022 - Wiley Online Library

In this study, the numerical calculation of optimal policy pairs in two‐person zero‐sum
stochastic games with unbounded reward functions and state‐dependent discount factors …

Markov decision processes with time-varying discount factors and random horizon

R Ilhuicatzi-Roldán, H Cruz-Suárez… - Kybernetika, 2017 - dml.cz

This paper is related to Markov Decision Processes. The optimal control problem is to
minimize the expected total discounted cost, with a non-constant discount factor. The …

被引用次数：5 相关文章所有 10 个版本

高级搜索

QQ 群