reinforcement learning algorithms- 学术资源搜索

A review on reinforcement learning algorithms and applications in supply chain management

B Rolf, I Jackson, M Müller, S Lang… - … Journal of Production …, 2023 - Taylor & Francis

… supply chain drivers, algorithms, data sources, … learning algorithm is still the most popular
one. Second, inventory management is the most common application of reinforcement learning …

被引用次数：93 相关文章所有 7 个版本

[PDF] google.com

Fundamental design principles for reinforcement learning algorithms

AM Devraj, A Bušić, S Meyn - Handbook of Reinforcement Learning and …, 2021 - Springer

… This chapter contains a survey of these concepts, along with a survey of the new class of Zap
reinforcement learning algorithms introduced by the authors. These algorithms can achieve …

被引用次数：20 相关文章所有 7 个版本

[PDF] ucl.ac.uk

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai… - Science, 2018 - science.org

… using the same algorithm and network … reinforcement learning algorithm can learn, tabula
rasa—without domain-specific human knowledge or data, as evidenced by the same algorithm …

被引用次数：4418 相关文章所有 20 个版本

A stochastic reinforcement learning algorithm for learning real-valued functions

V Gullapalli - Neural networks, 1990 - Elsevier

… Abstract--Most of the research in reinforcement learning has … a stochastic reinforcement
learning algorithm.~br learning[… Learning takes place bv using our algorithm to arlfl,st the two …

被引用次数：418 相关文章所有 6 个版本

[PDF] neurips.cc

Reinforcement learning algorithm for partially observable Markov decision problems

T Jaakkola, S Singh, M Jordan - Advances in neural …, 1994 - proceedings.neurips.cc

… the algorithms nor the analyses continue to be usable. We propose and analyze a new
learning algorithm to solve a certain class of non-Markov decision problems. Our algorithm …

被引用次数：621 相关文章所有 14 个版本

[PDF] academia.edu

[PDF][PDF] Reinforcement learning

RS Sutton, AG Barto - Journal of Cognitive Neuroscience, 1999 - academia.edu

… adjusting the time steps in these discrete-time algorithms to the scale of the task. But is this
… that the algorithm could not accomplish by itself? If reinforcement learning algorithms cannot …

被引用次数：275 相关文章所有 5 个版本

[PDF] researchgate.net

Reinforcement learning algorithms for solving classification problems

MA Wiering, H Van Hasselt… - … and Reinforcement …, 2011 - ieeexplore.ieee.org

… is quite different from supervised learning where an input is … that uses reinforcement learning
algorithms to solve … leads to competitive supervised learning algorithms, and what possible …

被引用次数：82 相关文章所有 9 个版本

[PDF] arxiv.org

Mastering chess and shogi by self-play with a general reinforcement learning algorithm

D Silver, T Hubert, J Schrittwieser, I Antonoglou… - arXiv preprint arXiv …, 2017 - arxiv.org

… In this section we discuss some notable prior work on reinforcement learning in computer
chess. NeuroChess (31) evaluated positions by a neural network that used 175 handcrafted …

被引用次数：2153 相关文章所有 18 个版本

[PDF] neurips.cc

Meta-gradient reinforcement learning

Z Xu, HP van Hasselt, D Silver - Advances in neural …, 2018 - proceedings.neurips.cc

… We derive a practical gradient-based meta-learning algorithm and show that this can
significantly improve performance on large-scale deep reinforcement learning applications. …

被引用次数：349 相关文章所有 7 个版本

Reinforcement learning: an introduction

S Thrun, ML Littman - AI Magazine, 2000 - go.gale.com

… Trust us; even if you are familiar with reinforcement learning, you … to Part 2 to learn
about reinforcement learning algorithms. … to start its algorithmic part with a description of Q-learning, …

被引用次数：103 相关文章

高级搜索

QQ 群