In many partially observable scenarios, Reinforcement Learning (RL) agents must rely on long-term memory in order to learn an optimal policy. We demonstrate that using techniques …
P Ladosz, E Ben-Iwhiwhu, J Dick, N Ketz… - … on Neural Networks …, 2021 - ieeexplore.ieee.org
In this article, we consider a subclass of partially observable Markov decision process (POMDP) problems which we termed confounding POMDPs. In these types of POMDPs …
K Ensinger, S Ziesche, B Rakitsch, M Tiemann… - Proceedings of the …, 2023 - ojs.aaai.org
Modeling an unknown dynamical system is crucial in order to predict the future behavior of the system. A standard approach is training recurrent models on measurement data. While …
In recent years, there has been a rise in complex and computationally expensive machine learning systems with many hyperparameters, such as deep convolutional neural networks …