Two steps to risk sensitivity

Q Wang, Y Lv, Z Xie, J Huang - Advances in Neural …, 2024 - proceedings.neurips.cc

Meta learning is a promising paradigm to enable skill transfer across tasks. Most previous
methods employ the empirical risk minimization principle in optimization. However, the …

被引用次数：3 相关文章所有 5 个版本

[PDF] neurips.cc

One risk to rule them all: A risk-sensitive perspective on model-based offline reinforcement learning

M Rigter, B Lacerda, N Hawes - Advances in Neural …, 2024 - proceedings.neurips.cc

Offline reinforcement learning (RL) is suitable for safety-critical domains where online
exploration is not feasible. In such domains, decision-making should take into consideration …

被引用次数：8 相关文章所有 7 个版本

[PDF] acm.org

adaparl: Adaptive privacy-aware reinforcement learning for sequential decision making human-in-the-loop systems

M Taherisadr, SA Stavroulakis, S Elmalaki - Proceedings of the 8th ACM …, 2023 - dl.acm.org

Reinforcement learning (RL) presents numerous benefits compared to rule-based
approaches in various applications. Privacy concerns have grown with the widespread use …

被引用次数：12 相关文章所有 5 个版本

[PDF] osf.io

Peril, prudence and planning as risk, avoidance and worry

C Gagne, P Dayan - Journal of Mathematical Psychology, 2022 - Elsevier

Risk occupies a central role in both the theory and practice of decision-making. Although it is
deeply implicated in many conditions involving dysfunctional behavior and thought, modern …

被引用次数：19 相关文章所有 13 个版本

[PDF] sagepub.com

People Place Larger Bets When Risky Choices Provide a Postbet Option to Cash Out

D Bennett, L Albertella, L Forbes… - Psychological …, 2023 - journals.sagepub.com

After a risky choice, decision makers must frequently wait out a delay period before the
outcome of their choice becomes known. In contemporary sports-betting apps, decision …

Risk-sensitive and robust model-based reinforcement learning and planning

M Rigter - arXiv preprint arXiv:2304.00573, 2023 - arxiv.org

Many sequential decision-making problems that are currently automated, such as those in
manufacturing or recommender systems, operate in an environment where there is either …

ACReL: Adversarial Conditional value-at-risk Reinforcement Learning

M Godbout, M Heuillet, S Chandra, R Bhati… - arXiv preprint arXiv …, 2021 - arxiv.org

In the classical Reinforcement Learning (RL) setting, one aims to find a policy that
maximizes its expected return. This objective may be inappropriate in safety-critical domains …

[PDF][PDF] Exploring Uncertainty in Distributional Reinforcement Learning

G Antonov, P Dayan - Reinforcement Learning Conference (RLC …, 2024 - rlj.cs.umass.edu

Epistemic uncertainty, which stems from what a learning algorithm does not know, is the
natural signal for exploration. Capturing and exploiting epistemic uncertainty for efficient …

[PDF][PDF] Exploring Optimal Risk-Sensitive Behavior in the Balloon Analogue Risk Task (BART)

X Sui, P Dayan, K Lloyd - Computational Psychiatry Conference …, 2023 - 2024.ccneuro.org

Attitudes towards risk play a crucial role in everyday decision-making as well as in
psychiatric disorders like anxiety. The Balloon Analogue Risk Task (BART) provides a …

[PDF] arxiv.org

Catastrophe, Compounding & Consistency in Choice

C Gagne, P Dayan - arXiv preprint arXiv:2111.06804, 2021 - arxiv.org

Conditional value-at-risk (CVaR) precisely characterizes the influence that rare, catastrophic
events can exert over decisions. Such characterizations are important for both normal …

高级搜索

QQ 群

A simple yet effective strategy to robustify the meta learning paradigm