Application of Newton's Method to action selection in continuous state-and action-space reinforce...

H Chen, B An, G Sharon, J Hanna, P Stone… - Proceedings of the …, 2018 - ojs.aaai.org

To alleviate traffic congestion in urban areas, electronic toll collection (ETC) systems are
deployed all over the world. Despite the merits, tolls are usually pre-determined and fixed …

被引用次数：35 相关文章所有 13 个版本

[PDF] ssrn.com

A reinforcement learning algorithm for trading commodities

F Giorgi, S Herzel, P Pigato - Applied Stochastic Models in …, 2024 - Wiley Online Library

We propose a reinforcement learning (RL) algorithm for generating a trading strategy in a
realistic setting, that includes transaction costs and factors driving the asset dynamics. We …

被引用次数：3 相关文章所有 7 个版本

[PDF] intjit.org

[PDF][PDF] A cooking assistance system for patients with Alzheimers disease using reinforcement learning

H Chen, Y Soh - Int J Inf Technol, 2017 - intjit.org

With the rapid increase of population with Alzheimer's disease, it has been more and more
costly for the society to provide personal care to the patients. Artificial intelligence (AI) …

被引用次数：12 相关文章

[PDF] researchgate.net

Continuous action-space reinforcement learning methods applied to the minimum-time swing-up of the acrobot

BD Nichols - 2015 IEEE International Conference on Systems …, 2015 - ieeexplore.ieee.org

Here I apply three reinforcement learning methods to the full, continuous action, swing-up
acrobot control benchmark problem. These include two approaches from the literature …

被引用次数：13 相关文章所有 5 个版本

[PDF] mdpi.com

Improved Exploration in Reinforcement Learning Environments with Low-Discrepancy Action Selection

SW Carden, JO Lindborg, Z Utic - AppliedMath, 2022 - mdpi.com

Reinforcement learning (RL) is a subdomain of machine learning concerned with achieving
optimal behavior by interacting with an unknown and potentially stochastic environment. The …

被引用次数：1 相关文章所有 3 个版本

[PDF] academia.edu

A comparison of action selection methods for implicit policy method reinforcement learning in continuous action-space

BD Nichols - 2016 International Joint Conference on Neural …, 2016 - ieeexplore.ieee.org

In this paper I investigate methods of applying reinforcement learning to continuous state-
and action-space problems without a policy function. I compare the performance of four …

被引用次数：8 相关文章所有 6 个版本

A critical point analysis of actor-critic algorithms with neural networks

M Gottwald, H Shen, K Diepold - IFAC-PapersOnLine, 2022 - Elsevier

Abstract We investigate Actor-Critic algorithms from the non-convex optimisation
perspective. For the past years, powerful Deep Reinforcement Learning algorithms, such as …

被引用次数：2 相关文章所有 2 个版本

[PDF] tum.de

Analysing Neuro-Dynamic Programming Through Non-Convex Optimisation

M Gottwald - 2024 - mediatum.ub.tum.de

Dynamic Programming and a Neural Network-based value-function approximation
approach have demonstrated superior performance in solving sequential decision making …

[PDF] academia.edu

[PDF][PDF] Improved Exploration in Reinforcement Learning Environments with Low-Discrepancy Action Selection. AppliedMath 2022, 2, 234–246

SW Carden, JO Lindborg, Z Utic - 2022 - academia.edu

Reinforcement learning (RL) is a subdomain of machine learning concerned with achieving
optimal behavior by interacting with an unknown and potentially stochastic environment. The …

[PDF] mdx.ac.uk

A comparison of eligibility trace and momentum on SARSA in continuous state-and action-space

BD Nichols - 2017 9th Computer Science and Electronic …, 2017 - ieeexplore.ieee.org

Here the Newton's Method direct action selection approach to continuous action-space
reinforcement learning is extended to use an eligibility trace. This is then compared to the …

被引用次数：2 相关文章所有 5 个版本

高级搜索

QQ 群