Dyetc: Dynamic electronic toll collection for traffic congestion alleviation

H Chen, B An, G Sharon, J Hanna, P Stone… - Proceedings of the …, 2018 - ojs.aaai.org
To alleviate traffic congestion in urban areas, electronic toll collection (ETC) systems are
deployed all over the world. Despite the merits, tolls are usually pre-determined and fixed …

A reinforcement learning algorithm for trading commodities

F Giorgi, S Herzel, P Pigato - Applied Stochastic Models in …, 2024 - Wiley Online Library
We propose a reinforcement learning (RL) algorithm for generating a trading strategy in a
realistic setting, that includes transaction costs and factors driving the asset dynamics. We …

[PDF][PDF] A cooking assistance system for patients with Alzheimers disease using reinforcement learning

H Chen, Y Soh - Int J Inf Technol, 2017 - intjit.org
With the rapid increase of population with Alzheimer's disease, it has been more and more
costly for the society to provide personal care to the patients. Artificial intelligence (AI) …

Continuous action-space reinforcement learning methods applied to the minimum-time swing-up of the acrobot

BD Nichols - 2015 IEEE International Conference on Systems …, 2015 - ieeexplore.ieee.org
Here I apply three reinforcement learning methods to the full, continuous action, swing-up
acrobot control benchmark problem. These include two approaches from the literature …

Improved Exploration in Reinforcement Learning Environments with Low-Discrepancy Action Selection

SW Carden, JO Lindborg, Z Utic - AppliedMath, 2022 - mdpi.com
Reinforcement learning (RL) is a subdomain of machine learning concerned with achieving
optimal behavior by interacting with an unknown and potentially stochastic environment. The …

A comparison of action selection methods for implicit policy method reinforcement learning in continuous action-space

BD Nichols - 2016 International Joint Conference on Neural …, 2016 - ieeexplore.ieee.org
In this paper I investigate methods of applying reinforcement learning to continuous state-
and action-space problems without a policy function. I compare the performance of four …

A critical point analysis of actor-critic algorithms with neural networks

M Gottwald, H Shen, K Diepold - IFAC-PapersOnLine, 2022 - Elsevier
Abstract We investigate Actor-Critic algorithms from the non-convex optimisation
perspective. For the past years, powerful Deep Reinforcement Learning algorithms, such as …

Analysing Neuro-Dynamic Programming Through Non-Convex Optimisation

M Gottwald - 2024 - mediatum.ub.tum.de
Dynamic Programming and a Neural Network-based value-function approximation
approach have demonstrated superior performance in solving sequential decision making …

[PDF][PDF] Improved Exploration in Reinforcement Learning Environments with Low-Discrepancy Action Selection. AppliedMath 2022, 2, 234–246

SW Carden, JO Lindborg, Z Utic - 2022 - academia.edu
Reinforcement learning (RL) is a subdomain of machine learning concerned with achieving
optimal behavior by interacting with an unknown and potentially stochastic environment. The …

A comparison of eligibility trace and momentum on SARSA in continuous state-and action-space

BD Nichols - 2017 9th Computer Science and Electronic …, 2017 - ieeexplore.ieee.org
Here the Newton's Method direct action selection approach to continuous action-space
reinforcement learning is extended to use an eligibility trace. This is then compared to the …