[PDF][PDF] Multi-objective reinforcement learning using sets of pareto dominating policies

K Van Moffaert, A Nowé - The Journal of Machine Learning Research, 2014 - jmlr.org
… We note that (1) Pareto Q-learning is able to learn the entire Pareto front under the usual …
, while (2) not being biased by the shape of the Pareto front. Furthermore, (3) the set evaluation …

Meta-learning for multi-objective reinforcement learning

X Chen, A Ghadirzadeh, M Björkman… - 2019 IEEE/RSJ …, 2019 - ieeexplore.ieee.org
… In contrast to earlier approaches We train a meta-policy to estimate the Pareto front implicitly
using different tasks sampled from a distribution of the preferences over the objectives, such …

A practical guide to multi-objective reinforcement learning and planning

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer
… from the Pareto Front, since we make the minimal assumption that utility functions in multi-objective
planning and learning belong to the class of monotonically increasing functions. …

Multi-objective reinforcement learning with continuous pareto frontier approximation

M Pirotta, S Parisi, M Restelli - Proceedings of the AAAI conference on …, 2015 - ojs.aaai.org
… In all the experiments the learning rate α was set by hand-tuning. We start considering a
multiobjective version of the standard discrete-time Linear-Quadratic Gaussian regulator (LQG) …

PMDRL: Pareto-front-based multi-objective deep reinforcement learning

F Yang, H Huang, W Shi, Y Ma, Y Feng… - Journal of Ambient …, 2023 - Springer
reinforcement learning. Firstly, we generalize the problem of multi-objective reinforcement
learning, then Pareto optimization is applied to Q-learning to select an action and estimate the …

Multi-objective reinforcement learning through continuous pareto manifold approximation

S Parisi, M Pirotta, M Restelli - Journal of Artificial Intelligence Research, 2016 - jair.org
… Subsequently, we focus our attention on sample complexity, meant as the number of rollouts
needed to approximate the Pareto front. Finally, we analyze the quality of our algorithm on …

[HTML][HTML] Exploring the pareto front of multi-objective covid-19 mitigation policies using reinforcement learning

M Reymond, CF Hayes, L Willem, R Rădulescu… - Expert Systems with …, 2024 - Elsevier
… cost, well-being), a multi-objective decision approach is warranted to obtain … multi-objective
reinforcement learning approach by building upon a state-of-the-art algorithm called Pareto

[PDF][PDF] Using multi-objective deep reinforcement learning to uncover a pareto front in multi-body trajectory design

CJ Sullivan, N Bosanac - AAS/AIAA Astrodynamics Specialist …, 2020 - researchgate.net
… A multi-objective deep reinforcement learning algorithm, designated Multi-Reward Proximal
… In this paper, MRPPO is used to uncover the Pareto front in a multi-objective optimization …

[PDF][PDF] On following pareto-optimal policies in multi-objective planning and reinforcement learning

DM Roijers, W Röpke, A Nowé… - … of the Multi-Objective …, 2021 - researchportal.vub.be
… Furthermore, we note that for stochastic policies, the Pareto Coverage set (Pareto front) is
always convex [12, 16], ie, the PCS is also the Convex Coverage Set, which is a highly …

Multi-objective reinforcement learning: Convexity, stationarity and pareto optimality

H Lu, D Herman, Y Yu - … International Conference on Learning …, 2023 - openreview.net
… On the limitations of scalarisation for multi-objective reinforcement learning of Pareto fronts.
In Wayne Wobcke and Mengjie Zhang (eds.), AI 2008: Advances in Artificial Intelligence, pp. …