A survey of recent results on continuous-time Markov decision processes

X Guo, O Hernández-Lerma, X Guo… - 2009 - Springer

In Chap. 2, we formally introduce the concepts associated to a continuous time MDP.
Namely, the basic model of continuous-time MDPs and the concept of a Markov policy are …

被引用次数：453 相关文章所有 11 个版本

[PDF] projecteuclid.org

Asymptotically optimal priority policies for indexable and nonindexable restless bandits

IM Verloop - 2016 - projecteuclid.org

We study the asymptotic optimal control of multi-class restless bandits. A restless bandit is a
controllable stochastic process whose state evolution depends on whether or not the bandit …

被引用次数：108 相关文章所有 15 个版本

Continuous-time Markov decision processes

A Piunovskiy, Y Zhang - Probability Theory and Stochastic Modelling, 2020 - Springer

The study of continuous-time Markov decision processes dates back at least to the 1950s,
shortly after that of its discrete-time analogue. Since then, the theory has rapidly developed …

被引用次数：34 相关文章所有 5 个版本

[PDF] vu.nl

Dynamic control of a single-server system with abandonments

DG Down, G Koole, ME Lewis - Queueing Systems, 2011 - Springer

In this paper, we discuss the dynamic server control in a two-class service system with
abandonments. Two models are considered. In the first case, rewards are received upon …

被引用次数：87 相关文章所有 17 个版本

[PDF] arxiv.org

Risk-sensitive control of continuous time Markov chains

MK Ghosh, S Saha - … An International Journal of Probability and …, 2014 - Taylor & Francis

We study risk-sensitive control of continuous time Markov chains taking values in discrete
state space. We study both finite and infinite horizon problems. In the finite horizon problem …

被引用次数：61 相关文章所有 9 个版本

[图书][B] Continuous average control of piecewise deterministic Markov processes

OL do Valle Costa, F Dufour - 2013 - Springer

The intent of this book is to present recent results in the control theory for the longrun
average continuous control problem of Piecewise Deterministic Markov Processes (PDMPs) …

被引用次数：70 相关文章所有 9 个版本

Discounted continuous-time Markov decision processes with constraints: unbounded transition and loss rates

X Guo, A Piunovskiy - Mathematics of Operations Research, 2011 - pubsonline.informs.org

This paper deals with denumerable continuous-time Markov decision processes (MDP) with
constraints. The optimality criterion to be minimized is expected discounted loss, while …

被引用次数：74 相关文章所有 6 个版本

[PDF] utwente.nl

Uniformization: Basics, extensions and applications

NM van Dijk, SPJ van Brummelen, RJ Boucherie - Performance evaluation, 2018 - Elsevier

Uniformization, also referred to as randomization, is a well-known performance evaluation
technique to model and analyse continuous-time Markov chains via an easier to …

被引用次数：40 相关文章所有 5 个版本

Discounted continuous-time Markov decision processes with unbounded rates: the convex analytic approach

A Piunovskiy, Y Zhang - SIAM journal on control and optimization, 2011 - SIAM

This paper deals with constrained discounted continuous-time Markov decision processes,
also known as controlled jump Markov processes, with Borel state and action spaces. Under …

被引用次数：66 相关文章所有 6 个版本

[图书][B] Selected topics on continuous-time controlled Markov chains and Markov games

T Prieto-Rumeau, O Hernández-Lerma - 2012 - books.google.com

This book concerns continuous-time controlled Markov chains, also known as continuous-
time Markov decision processes. They form a class of stochastic control problems in which a …

被引用次数：64 相关文章所有 7 个版本

高级搜索

QQ 群