reinforcement learning algorithm- 学术资源搜索

Reinforcement learning model, algorithms and its application

W Qiang, Z Zhongli - 2011 International Conference on …, 2011 - ieeexplore.ieee.org

… In this paper, we firstly survey the model and theory of reinforcement learning. Then, we
roundly present the main reinforcement learning algorithms, including Sarsa, … Q-learning …

被引用次数：170 相关文章所有 2 个版本

[PDF] bookfusion.com

[图书][B] Algorithms for reinforcement learning

C Szepesvári - 2022 - books.google.com

… Reinforcement learning is of great … algorithms of reinforcement learning that build on the
powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning …

被引用次数：2105 相关文章所有 24 个版本

[PDF] neurips.cc

Discovering reinforcement learning algorithms

J Oh, M Hessel, WM Czarnecki, Z Xu… - Advances in …, 2020 - proceedings.neurips.cc

… -learning learning algorithms) in AI-GAs [7]. However, we aim to achieve generalisation
not just across tasks but also across different domains. Learning domain-invariant algorithms …

被引用次数：142 相关文章所有 8 个版本

[PDF] jair.org

Reinforcement learning: A survey

LP Kaelbling, ML Littman, AW Moore - Journal of artificial intelligence …, 1996 - jair.org

… This paper surveys the field of reinforcement learning from a … to researchers familiar with
machine learning. Both the historical … algorithms in this section address the problem of learning …

被引用次数：11694 相关文章所有 77 个版本

相关搜索

Reinforcement learning algorithms: A brief survey

AK Shakya, G Pillai, S Chakrabarty - Expert Systems with Applications, 2023 - Elsevier

… , such as learning to play video games just from pixel information, are now successfully solved
using deep reinforcement learning. … model-free RL algorithms and pathbreaking function …

被引用次数：74 相关文章所有 2 个版本

A stochastic reinforcement learning algorithm for learning real-valued functions

V Gullapalli - Neural networks, 1990 - Elsevier

… Abstract--Most of the research in reinforcement learning has … a stochastic reinforcement
learning algorithm.~br learning[… Learning takes place bv using our algorithm to arlfl,st the two …

被引用次数：419 相关文章所有 6 个版本

[PDF] inesc-id.pt

[PDF][PDF] Algorithm Selection using Reinforcement Learning.

MG Lagoudakis, ML Littman - ICML, 2000 - algos.inesc-id.pt

… the most appropriate algorithm from the set for a given problem instance. We show how a
reinforcement learning approach can be used to select the right algorithm for each instance at …

被引用次数：198 相关文章所有 9 个版本

[PDF] ucl.ac.uk

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai… - Science, 2018 - science.org

… using the same algorithm and network … reinforcement learning algorithm can learn, tabula
rasa—without domain-specific human knowledge or data, as evidenced by the same algorithm …

被引用次数：4464 相关文章所有 20 个版本

[PDF] arxiv.org

Deep reinforcement learning: An overview

Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org

… Temporal difference learning algorithms are fundamental for evaluating/predicting value …
Control algorithms find optimal policies. Reinforcement learning algorithms may be based on …

被引用次数：1798 相关文章所有 6 个版本

[PDF] ucl.ac.uk

Deep reinforcement learning: A brief survey

K Arulkumaran, MP Deisenroth… - IEEE Signal …, 2017 - ieeexplore.ieee.org

… has similarly accelerated progress in RL, with the use of deeplearning algorithms within RL
defining the field of DRL. The aim of this survey is to cover both seminal and recent develop…

被引用次数：3266 相关文章所有 6 个版本

高级搜索

QQ 群