A review of deep reinforcement learning approaches for smart manufacturing in industry 4.0 and 5.0 framework

A del Real Torres, DS Andreiana, Á Ojeda Roldán… - Applied Sciences, 2022 - mdpi.com
In this review, the industry's current issues regarding intelligent manufacture are presented.
This work presents the status and the potential for the I4. 0 and I5. 0's revolutionary …

A practical guide to multi-objective reinforcement learning and planning

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer
Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com
Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

Task-based end-to-end model learning in stochastic optimization

P Donti, B Amos, JZ Kolter - Advances in neural information …, 2017 - proceedings.neurips.cc
With the increasing popularity of machine learning techniques, it has become common to
see prediction algorithms operating within some larger process. However, the criteria by …

Multi-objective deep reinforcement learning

H Mossalam, YM Assael, DM Roijers… - arXiv preprint arXiv …, 2016 - arxiv.org
We propose Deep Optimistic Linear Support Learning (DOL) to solve high-dimensional multi-
objective decision problems where the relative importances of the objectives are not known …

[图书][B] Multi-objective decision making

DM Roijers, S Whiteson, R Brachman, P Stone - 2017 - Springer
Many real-world decision problems have multiple objectives. For example, when choosing a
medical treatment plan, we want to maximize the efficacy of the treatment, but also minimize …

Ordered preference elicitation strategies for supporting multi-objective decision making

LM Zintgraf, DM Roijers, S Linders, CM Jonker… - arXiv preprint arXiv …, 2018 - arxiv.org
In multi-objective decision planning and learning, much attention is paid to producing
optimal solution sets that contain an optimal policy for every possible user preference profile …

Sample-efficient multi-objective learning via generalized policy improvement prioritization

LN Alegre, ALC Bazzan, DM Roijers, A Nowé… - arXiv preprint arXiv …, 2023 - arxiv.org
Multi-objective reinforcement learning (MORL) algorithms tackle sequential decision
problems where agents may have different preferences over (possibly conflicting) reward …

Softmax exploration strategies for multiobjective reinforcement learning

P Vamplew, R Dazeley, C Foale - Neurocomputing, 2017 - Elsevier
Despite growing interest over recent years in applying reinforcement learning to
multiobjective problems, there has been little research into the applicability and …

The impact of environmental stochasticity on value-based multiobjective reinforcement learning

P Vamplew, C Foale, R Dazeley - Neural Computing and Applications, 2022 - Springer
A common approach to address multiobjective problems using reinforcement learning
methods is to extend model-free, value-based algorithms such as Q-learning to use a vector …