We present a reinforcement learning (RL) approach for robust optimization of risk-aware performance criteria. To allow agents to express a wide variety of risk-reward profiles, we …
We propose a novel framework to solve risk-sensitive reinforcement learning problems where the agent optimizes time-consistent dynamic spectral risk measures. Based on the …
We develop an approach for solving time‐consistent risk‐sensitive stochastic optimization problems using model‐free reinforcement learning (RL). Specifically, we assume agents …
Z Wu, R Xu - arXiv preprint arXiv:2311.13589, 2023 - arxiv.org
Reinforcement Learning (RL) has gained substantial attention across diverse application domains and theoretical investigations. Existing literature on RL theory largely focuses on …
X Yu, L Ying - International Conference on Machine …, 2023 - proceedings.mlr.press
Risk-sensitive reinforcement learning (RL) has become a popular tool to control the risk of uncertain outcomes and ensure reliable performance in various sequential decision-making …
W Ou, S Bi - arXiv preprint arXiv:2404.00940, 2024 - arxiv.org
This review paper provides an in-depth overview of the evolution and advancements in Robust Markov Decision Processes (RMDPs), a field of paramount importance for its role in …
Reinforcement learning algorithms utilizing policy gradients (PG) to optimize Conditional Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their …
We consider the problem of estimating the asymptotic variance of a function defined on a Markov chain, an important step for statistical inference of the stationary mean. We design a …
S Chaudhary, U Dinesha, D Kalathil… - arXiv preprint arXiv …, 2025 - arxiv.org
We consider the challenge of mitigating the generation of negative or toxic content by the Large Language Models (LLMs) in response to certain prompts. We propose integrating risk …