A review on reinforcement learning algorithms and applications in supply chain management

B Rolf, I Jackson, M Müller, S Lang… - … Journal of Production …, 2023 - Taylor & Francis
… supply chain drivers, algorithms, data sources, … learning algorithm is still the most popular
one. Second, inventory management is the most common application of reinforcement learning

Fundamental design principles for reinforcement learning algorithms

AM Devraj, A Bušić, S Meyn - Handbook of Reinforcement Learning and …, 2021 - Springer
… This chapter contains a survey of these concepts, along with a survey of the new class of Zap
reinforcement learning algorithms introduced by the authors. These algorithms can achieve …

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai… - Science, 2018 - science.org
… using the same algorithm and network … reinforcement learning algorithm can learn, tabula
rasa—without domain-specific human knowledge or data, as evidenced by the same algorithm

A stochastic reinforcement learning algorithm for learning real-valued functions

V Gullapalli - Neural networks, 1990 - Elsevier
… Abstract--Most of the research in reinforcement learning has … a stochastic reinforcement
learning algorithm.~br learning[… Learning takes place bv using our algorithm to arlfl,st the two …

Reinforcement learning algorithm for partially observable Markov decision problems

T Jaakkola, S Singh, M Jordan - Advances in neural …, 1994 - proceedings.neurips.cc
… the algorithms nor the analyses continue to be usable. We propose and analyze a new
learning algorithm to solve a certain class of non-Markov decision problems. Our algorithm

[PDF][PDF] Reinforcement learning

RS Sutton, AG Barto - Journal of Cognitive Neuroscience, 1999 - academia.edu
… adjusting the time steps in these discrete-time algorithms to the scale of the task. But is this
… that the algorithm could not accomplish by itself? If reinforcement learning algorithms cannot …

Reinforcement learning algorithms for solving classification problems

MA Wiering, H Van Hasselt… - … and Reinforcement …, 2011 - ieeexplore.ieee.org
… is quite different from supervised learning where an input is … that uses reinforcement learning
algorithms to solve … leads to competitive supervised learning algorithms, and what possible …

Mastering chess and shogi by self-play with a general reinforcement learning algorithm

D Silver, T Hubert, J Schrittwieser, I Antonoglou… - arXiv preprint arXiv …, 2017 - arxiv.org
… In this section we discuss some notable prior work on reinforcement learning in computer
chess. NeuroChess (31) evaluated positions by a neural network that used 175 handcrafted …

Meta-gradient reinforcement learning

Z Xu, HP van Hasselt, D Silver - Advances in neural …, 2018 - proceedings.neurips.cc
… We derive a practical gradient-based meta-learning algorithm and show that this can
significantly improve performance on large-scale deep reinforcement learning applications. …

Reinforcement learning: an introduction

S Thrun, ML Littman - AI Magazine, 2000 - go.gale.com
… Trust us; even if you are familiar with reinforcement learning, you … to Part 2 to learn
about reinforcement learning algorithms. … to start its algorithmic part with a description of Q-learning, …