Bias optimality for continuous-time controlled Markov chains

X Guo, O Hernández-Lerma, X Guo… - 2009 - Springer

In Chap. 2, we formally introduce the concepts associated to a continuous time MDP.
Namely, the basic model of continuous-time MDPs and the concept of a Markov policy are …

被引用次数：453 相关文章所有 11 个版本

[PDF] academia.edu

Characterizations of overtaking optimality for controlled diffusion processes

H Jasso-Fuentes, O Hernández-Lerma - Applied Mathematics and …, 2008 - Springer

In this paper we give conditions for (the existence and) several characterizations of
overtaking optimal policies for a general class of controlled diffusion processes. Our …

被引用次数：94 相关文章所有 11 个版本

[图书][B] Selected topics on continuous-time controlled Markov chains and Markov games

T Prieto-Rumeau, O Hernández-Lerma - 2012 - books.google.com

This book concerns continuous-time controlled Markov chains, also known as continuous-
time Markov decision processes. They form a class of stochastic control problems in which a …

被引用次数：64 相关文章所有 7 个版本

[PDF] researchgate.net

A survey of recent results on continuous-time Markov decision processes

X Guo, O Hernández-Lerma, T Prieto-Rumeau, XR Cao… - Top, 2006 - Springer

This paper is a survey of recent results on continuous-time Markov decision processes
(MDPs) with unbounded transition rates, and reward rates that may be unbounded from …

被引用次数：85 相关文章所有 11 个版本

[HTML] scielo.org.mx

[HTML][HTML] Toma de decisiones de agentes racionales con procesos markovianos. Avances recientes en economía y finanzas

O Hernández-Lerma, F Venegas-Martínez - El trimestre económico, 2012 - scielo.org.mx

En esta investigación se revisa la evolución teórica y práctica de los procesos markovianos
y se resalta su rápido avance y notorio potencial en el modelado de los procesos de toma …

被引用次数：16 相关文章所有 15 个版本

[PDF] wayne.edu

On some ergodic impulse control problems with constraint

JL Menaldi, M Robin - SIAM Journal on Control and Optimization, 2018 - SIAM

This paper studies the impulse control of a general Markov process under the average (or
ergodic) cost when the impulse instants are restricted to be the arrival times of an …

被引用次数：24 相关文章所有 6 个版本

[PDF] researchgate.net

Bias and overtaking equilibria for zero-sum continuous-time Markov games

T Prieto-Rumeau, O Hernández-Lerma - Mathematical Methods of …, 2005 - Springer

This paper deals with continuous-time zero-sum two-person Markov games with
denumerable state space, general (Borel) action spaces and possibly unbounded transition …

被引用次数：54 相关文章所有 10 个版本

[PDF] researchgate.net

Ergodic control of continuous-time Markov chains with pathwise constraints

T Prieto-Rumeau, O Hernández-Lerma - SIAM journal on control and …, 2008 - SIAM

This paper deals with unichain ergodic continuous-time controlled Markov chains (CMCs)
with a denumerable state space and possibly unbounded reward rates as well as …

被引用次数：30 相关文章所有 10 个版本

Approximating ergodic average reward continuous-time controlled Markov chains

T Prieto-Rumeau, JÉMÍ Lorenzo - IEEE transactions on …, 2009 - ieeexplore.ieee.org

We study the approximation of an ergodic average reward continuous-time denumerable
state Markov decision process (MDP) by means of a sequence of MDPs. Our results include …

被引用次数：22 相关文章所有 5 个版本

[PDF] researchgate.net

The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains

T Prieto-Rumeau, O Hernández-Lerma - Mathematical Methods of …, 2005 - Springer

This paper gives conditions for the convergence of the Laurent series expansion for a class
of continuous-time controlled Markov chains with possibly unbounded reward (or cost) rates …

被引用次数：25 相关文章所有 10 个版本

高级搜索

QQ 群