Reinforcement learning model, algorithms and its application

W Qiang, Z Zhongli - 2011 International Conference on …, 2011 - ieeexplore.ieee.org
… In this paper, we firstly survey the model and theory of reinforcement learning. Then, we
roundly present the main reinforcement learning algorithms, including Sarsa, … Q-learning

[图书][B] Algorithms for reinforcement learning

C Szepesvári - 2022 - books.google.com
Reinforcement learning is of great … algorithms of reinforcement learning that build on the
powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning

Discovering reinforcement learning algorithms

J Oh, M Hessel, WM Czarnecki, Z Xu… - Advances in …, 2020 - proceedings.neurips.cc
… -learning learning algorithms) in AI-GAs [7]. However, we aim to achieve generalisation
not just across tasks but also across different domains. Learning domain-invariant algorithms

Reinforcement learning: A survey

LP Kaelbling, ML Littman, AW Moore - Journal of artificial intelligence …, 1996 - jair.org
… This paper surveys the field of reinforcement learning from a … to researchers familiar with
machine learning. Both the historical … algorithms in this section address the problem of learning

Reinforcement learning algorithms: A brief survey

AK Shakya, G Pillai, S Chakrabarty - Expert Systems with Applications, 2023 - Elsevier
… , such as learning to play video games just from pixel information, are now successfully solved
using deep reinforcement learning. … model-free RL algorithms and pathbreaking function …

A stochastic reinforcement learning algorithm for learning real-valued functions

V Gullapalli - Neural networks, 1990 - Elsevier
… Abstract--Most of the research in reinforcement learning has … a stochastic reinforcement
learning algorithm.~br learning[… Learning takes place bv using our algorithm to arlfl,st the two …

[PDF][PDF] Algorithm Selection using Reinforcement Learning.

MG Lagoudakis, ML Littman - ICML, 2000 - algos.inesc-id.pt
… the most appropriate algorithm from the set for a given problem instance. We show how a
reinforcement learning approach can be used to select the right algorithm for each instance at …

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai… - Science, 2018 - science.org
… using the same algorithm and network … reinforcement learning algorithm can learn, tabula
rasa—without domain-specific human knowledge or data, as evidenced by the same algorithm

Deep reinforcement learning: An overview

Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org
… Temporal difference learning algorithms are fundamental for evaluating/predicting value …
Control algorithms find optimal policies. Reinforcement learning algorithms may be based on …

Deep reinforcement learning: A brief survey

K Arulkumaran, MP Deisenroth… - IEEE Signal …, 2017 - ieeexplore.ieee.org
… has similarly accelerated progress in RL, with the use of deeplearning algorithms within RL
defining the field of DRL. The aim of this survey is to cover both seminal and recent develop…