A probabilistic analysis of bias optimality in unichain Markov decision processes

IEEE Transactions on Automatic Control, 2001 - ieeexplore.ieee.org
Focuses on bias optimality in unichain, finite state, and action-space Markov decision
processes. Using relative value functions, we present methods for evaluating optimal bias …

[PDF][PDF] A Probabilistic Analysis of Bias Optimality in Unichain Markov Decision Processes123

ME Lewis, ML Puterman - people.orie.cornell.edu
This paper focuses on bias optimality in unichain, finite state and action space Markov
Decision Processes. Using relative value functions, we present new methods for evaluating …

[PDF][PDF] A Probabilistic Analysis of Bias Optimality in Unichain Markov Decision Processes y

ME Lewis, ML Puterman - researchgate.net
Since the long-run average reward optimality criterion is underselective, a decisionmaker
often uses bias to distinguish between multiple average optimal policies. We study bias …

[PDF][PDF] A Probabilistic Analysis of Bias Optimality in Unichain Markov Decision Processes y

ME Lewis, ML Puterman - scholar.archive.org
Since the long-run average reward optimality criterion is underselective, a decisionmaker
often uses bias to distinguish between multiple average optimal policies. We study bias …

[PDF][PDF] A Probabilistic Analysis of Bias Optimality in Unichain Markov Decision Processes y

ME Lewis, ML Puterman - Citeseer
Since the long-run average reward optimality criterion is underselective, a decisionmaker
often uses bias to distinguish between multiple average optimal policies. We study bias …

[PDF][PDF] A Probabilistic Analysis of Bias Optimality in Unichain Markov Decision Processes123

ME Lewis, ML Puterman - people.orie.cornell.edu
This paper focuses on bias optimality in unichain, finite state and action space Markov
Decision Processes. Using relative value functions, we present new methods for evaluating …