Ergodic risk-sensitive control—a survey

A Biswas, VS Borkar - Annual Reviews in Control, 2023 - Elsevier
Risk-sensitive control has received considerable interest since the seminal work of Howard
and Matheson (Howard and Matheson, 1971/72) because of its ability to account for …

A unified view of entropy-regularized markov decision processes

G Neu, A Jonsson, V Gómez - arXiv preprint arXiv:1705.07798, 2017 - arxiv.org
We propose a general framework for entropy-regularized average-reward reinforcement
learning in Markov decision processes (MDPs). Our approach is based on extending the …

Safe exploration in markov decision processes

TM Moldovan, P Abbeel - arXiv preprint arXiv:1205.4810, 2012 - arxiv.org
In environments with uncertain dynamics exploration is necessary to learn how to perform
well. Existing reinforcement learning algorithms provide strong exploration guarantees, but …

Exponential bellman equation and improved regret bounds for risk-sensitive reinforcement learning

Y Fei, Z Yang, Y Chen, Z Wang - Advances in neural …, 2021 - proceedings.neurips.cc
We study risk-sensitive reinforcement learning (RL) based on the entropic risk measure.
Although existing works have established non-asymptotic regret guarantees for this …

On stochastic optimal control and reinforcement learning by approximate inference

K Rawlik, M Toussaint, S Vijayakumar - 2013 - direct.mit.edu
We present a reformulation of the stochastic optimal control problem in terms of KL
divergence minimisation, not only providing a unifying perspective of previous approaches …

Ideologies, values, attitudes, and behavior

GR Maio, JM Olson, MM Bernard, MA Luke - Handbook of social …, 2006 - Springer
History is replete with cases wiiere people have worked hard as individuals and groups for
causes they regarded as important; these efforts have ranged in intensity from simple …

Risk-sensitive reinforcement learning with function approximation: A debiasing approach

Y Fei, Z Yang, Z Wang - International Conference on …, 2021 - proceedings.mlr.press
We study function approximation for episodic reinforcement learning with entropic risk
measure. We first propose an algorithm with linear function approximation. Compared to …

Robust, risk-sensitive, and data-driven control of Markov decision processes

Y Le Tallec - 2007 - dspace.mit.edu
Markov Decision Processes (MDPs) model problems of sequential decision-making under
uncertainty. They have been studied and applied extensively. Nonetheless, there are two …

Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret

Y Fei, Z Yang, Y Chen, Z Wang… - Advances in Neural …, 2020 - proceedings.neurips.cc
We study risk-sensitive reinforcement learning in episodic Markov decision processes with
unknown transition kernels, where the goal is to optimize the total reward under the risk …

Regret bounds for markov decision processes with recursive optimized certainty equivalents

W Xu, X Gao, X He - International Conference on Machine …, 2023 - proceedings.mlr.press
The optimized certainty equivalent (OCE) is a family of risk measures that cover important
examples such as entropic risk, conditional value-at-risk and mean-variance models. In this …