[HTML][HTML] A practical guide to multi-objective reinforcement learning and planning

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer
Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

Sample-efficient multi-objective learning via generalized policy improvement prioritization

LN Alegre, ALC Bazzan, DM Roijers, A Nowé… - arXiv preprint arXiv …, 2023 - arxiv.org
Multi-objective reinforcement learning (MORL) algorithms tackle sequential decision
problems where agents may have different preferences over (possibly conflicting) reward …

[PDF][PDF] Distributional monte carlo tree search for risk-aware and multi-objective reinforcement learning

CF Hayes, M Reymond, DM Roijers, E Howley… - Proceedings of the 20th …, 2021 - ifaamas.org
In many risk-aware and multi-objective reinforcement learning settings, the utility of the user
is derived from the single execution of a policy. In these settings, making decisions based on …

[HTML][HTML] Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning

CF Hayes, M Reymond, DM Roijers, E Howley… - Autonomous Agents and …, 2023 - Springer
In many risk-aware and multi-objective reinforcement learning settings, the utility of the user
is derived from a single execution of a policy. In these settings, making decisions based on …

[HTML][HTML] Expected scalarised returns dominance: A new solution concept for multi-objective decision making

CF Hayes, T Verstraeten, DM Roijers, E Howley… - Neural Computing and …, 2022 - Springer
In many real-world scenarios, the utility of a user is derived from a single execution of a
policy. In this case, to apply multi-objective reinforcement learning, the expected utility of the …

[PDF][PDF] Decision-theoretic planning for the expected scalarised returns

CF Hayes, DM Roijers, E Howley… - Proceedings of the 21st …, 2022 - ifmas.csc.liv.ac.uk
In sequential multi-objective decision making (MODeM) settings, when the utility of a user is
derived from a single execution of a policy, policies for the expected scalarised returns …

Multi-objective coordination graphs for the expected scalarised returns with generative flow models

CF Hayes, T Verstraeten, DM Roijers, E Howley… - arXiv preprint arXiv …, 2022 - arxiv.org
Many real-world problems contain multiple objectives and agents, where a trade-off exists
between objectives. Key to solving such problems is to exploit sparse dependency …

[PDF][PDF] Multi-objective distributional value iteration

CF Hayes, DM Roijers, E Howley… - Adaptive and Learning …, 2022 - researchgate.net
In sequential multi-objective decision making (MODeM) settings, when the utility of a user is
derived from a single execution of a policy, policies for the expected scalarised returns …

[PDF][PDF] Multi-objective decision making for trustworthy ai

P Mannion, F Heintz… - … of the Multi …, 2021 - modem2021.cs.universityofgalway …
If widespread deployment of AI systems is to be accepted by society in the future, it is crucial
that such systems are trustworthy. Trustworthiness for autonomous systems has a number of …

Hybrid Surrogate Assisted Evolutionary Multiobjective Reinforcement Learning for Continuous Robot Control

A Mazumdar, V Kyrki - International Conference on the Applications of …, 2024 - Springer
Many real world reinforcement learning (RL) problems consist of multiple conflicting
objective functions that need to be optimized simultaneously. Finding these optimal policies …