查看文章

springer.com 中的 [HTML]

Expected scalarised returns dominance: a new solution concept for multi-objective decision making

作者

Conor Francis Hayes, Timothy Verstraeten, Diederik Marijn Roijers, Enda Howley, Patrick Mannion

发表日期

2022/7/5

期刊

Neural Computing and Applications

简介

In many real-world scenarios, the utility of a user is derived from a single execution of a policy. In this case, to apply multi-objective reinforcement learning, the expected utility of the returns must be optimised. Various scenarios exist where a user’s preferences over objectives (also known as the utility function) are unknown or difficult to specify. In such scenarios, a set of optimal policies must be learned. However, settings where the expected utility must be maximised have been largely overlooked by the multi-objective reinforcement learning community and, as a consequence, a set of optimal solutions has yet to be defined. In this work, we propose first-order stochastic dominance as a criterion to build solution sets to maximise expected utility. We also define a new dominance criterion, known as expected scalarised returns (ESR) dominance, that extends first-order stochastic dominance to allow a set of optimal …

引用总数

被引用次数：16

20212022202320241 4 7 4

学术搜索中的文章

Expected scalarised returns dominance: A new solution concept for multi-objective decision making

CF Hayes, T Verstraeten, DM Roijers, E Howley… - Neural Computing and Applications, 2022

被引用次数：16 相关文章所有 6 个版本