A probabilistic analysis of bias optimality in unichain Markov decision processes- 学术资源搜索

文章

学术资源搜索

A probabilistic analysis of bias optimality in unichain Markov decision processes

ME Lewis, ML Puterman - IEEE Transactions on Automatic …, 2001 - ieeexplore.ieee.org

IEEE Transactions on Automatic Control, 2001•ieeexplore.ieee.org

Focuses on bias optimality in unichain, finite state, and action-space Markov decision
processes. Using relative value functions, we present methods for evaluating optimal bias,
this leads to a probabilistic analysis which transforms the original reward problem into a
minimum average cost problem. The result is an explanation of how and why bias implicitly
discounts future rewards.

Focuses on bias optimality in unichain, finite state, and action-space Markov decision processes. Using relative value functions, we present methods for evaluating optimal bias, this leads to a probabilistic analysis which transforms the original reward problem into a minimum average cost problem. The result is an explanation of how and why bias implicitly discounts future rewards.

ieeexplore.ieee.org

展开收起

被引用次数：49 相关文章所有 7 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

A probabilistic analysis of bias optimality in unichain Markov decision processes

引用