[图书][B] Continuous-time Markov decision processes

X Guo, O Hernández-Lerma, X Guo… - 2009 - Springer
In Chap. 2, we formally introduce the concepts associated to a continuous time MDP.
Namely, the basic model of continuous-time MDPs and the concept of a Markov policy are …

Characterizations of overtaking optimality for controlled diffusion processes

H Jasso-Fuentes, O Hernández-Lerma - Applied Mathematics and …, 2008 - Springer
In this paper we give conditions for (the existence and) several characterizations of
overtaking optimal policies for a general class of controlled diffusion processes. Our …

[图书][B] Selected topics on continuous-time controlled Markov chains and Markov games

T Prieto-Rumeau, O Hernández-Lerma - 2012 - books.google.com
This book concerns continuous-time controlled Markov chains, also known as continuous-
time Markov decision processes. They form a class of stochastic control problems in which a …

A survey of recent results on continuous-time Markov decision processes

X Guo, O Hernández-Lerma, T Prieto-Rumeau, XR Cao… - Top, 2006 - Springer
This paper is a survey of recent results on continuous-time Markov decision processes
(MDPs) with unbounded transition rates, and reward rates that may be unbounded from …

[HTML][HTML] Toma de decisiones de agentes racionales con procesos markovianos. Avances recientes en economía y finanzas

O Hernández-Lerma, F Venegas-Martínez - El trimestre económico, 2012 - scielo.org.mx
En esta investigación se revisa la evolución teórica y práctica de los procesos markovianos
y se resalta su rápido avance y notorio potencial en el modelado de los procesos de toma …

On some ergodic impulse control problems with constraint

JL Menaldi, M Robin - SIAM Journal on Control and Optimization, 2018 - SIAM
This paper studies the impulse control of a general Markov process under the average (or
ergodic) cost when the impulse instants are restricted to be the arrival times of an …

Bias and overtaking equilibria for zero-sum continuous-time Markov games

T Prieto-Rumeau, O Hernández-Lerma - Mathematical Methods of …, 2005 - Springer
This paper deals with continuous-time zero-sum two-person Markov games with
denumerable state space, general (Borel) action spaces and possibly unbounded transition …

Ergodic control of continuous-time Markov chains with pathwise constraints

T Prieto-Rumeau, O Hernández-Lerma - SIAM journal on control and …, 2008 - SIAM
This paper deals with unichain ergodic continuous-time controlled Markov chains (CMCs)
with a denumerable state space and possibly unbounded reward rates as well as …

Approximating ergodic average reward continuous-time controlled Markov chains

T Prieto-Rumeau, JÉMÍ Lorenzo - IEEE transactions on …, 2009 - ieeexplore.ieee.org
We study the approximation of an ergodic average reward continuous-time denumerable
state Markov decision process (MDP) by means of a sequence of MDPs. Our results include …

The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains

T Prieto-Rumeau, O Hernández-Lerma - Mathematical Methods of …, 2005 - Springer
This paper gives conditions for the convergence of the Laurent series expansion for a class
of continuous-time controlled Markov chains with possibly unbounded reward (or cost) rates …