Model-based reinforcement learning via imagination with derived memory

Y Mu, Y Zhuang, B Wang, G Zhu… - Advances in …, 2021 - proceedings.neurips.cc
Abstract Model-based reinforcement learning aims to improve the sample efficiency of policy
learning by modeling the dynamics of the environment. Recently, the latent dynamics model …

Mixed reinforcement learning for efficient policy optimization in stochastic environments

Y Mu, B Peng, Z Gu, SE Li, C Liu, B Nie… - … and Systems (ICCAS …, 2020 - ieeexplore.ieee.org
Reinforcement learning has the potential to control stochastic nonlinear systems in optimal
manners successfully. We propose a mixed reinforcement learning (mixed RL) algorithm by …

Multiobjective in-core fuel management optimisation for nuclear research reactors

EB Schlunz - 2016 - scholar.sun.ac.za
Abstract ENGLISH SUMMARY: The efficiency and effectiveness of fuel usage in a typical
nuclear reactor is influenced by the specific arrangement of available fuel assemblies in the …

[PDF][PDF] Research reactor in-core fuel management optimisation using the multiobjective cross-entropy method

EB Schlünz, PM Bokov… - Proceedings of the 2014 …, 2014 - researchgate.net
The in-core fuel management optimisation (ICFMO) problem has been studied for several
decades. Very little research has, however, been aimed at multiobjective optimisation …

Development of a discrete-event, stochastic multi-objective metaheuristic simulation optimisation suite for a commercial software package

T Bamporiki, J Bekker - South African Journal of Industrial …, 2018 - journals.co.za
In this paper, the authors present the development of an optimisation suite and its
implementation. This paper is part of an ongoing project that aims at developing a hybrid …

New multi-objective ranking and selection procedures for discrete stochastic simulation problems

M Yoon - 2018 - scholar.sun.ac.za
In stochastic simulation optimisation, several system designs are considered. These designs
are ranked in order and the best is selected based on one or more performance measures …

[PDF][PDF] Using a discrete-event, simulation optimisation optimiser to solve a stochastic multi-objective NP-hard problem

T Bamporiki, J Bekker, M Yoon - COMA'19, 2019 - academia.edu
This paper presents the use of the cross-entropy method for multi-objective optimisation
(MOO CEM) metaheuristic and the multi-objective moonyoung yoon (MMY) procedure in a …

The application of the cross-entropy method for multi-objective optimisation to combinatorial problems

C Hauman - 2012 - scholar.sun.ac.za
Society is continually in search of ways to optimise various objectives. When faced with
multiple and con icting objectives, humans are in need of solution techniques to enable …

[PDF][PDF] Investigating hyperheuristics for solving bi-objective simulation optimisation problems

L Nigrini - 2023 - scholar.sun.ac.za
The investigation and exploration of search and optimisation methodologies are crucial
research areas. Take for example the potential impact of an effective and computationally …

A comparative study on the value of accounting for possible relationships between decision variables when solving multi-objective problems

E Scholtz - 2014 - scholar.sun.ac.za
The cross-entropy method for multi-objective optimisation (MOO CEM) was recently
introduced by Bekker & Aldrich (2010) and Bekker (2012). Results presented by both show …