reinforcement learning algorithms- 学术资源搜索

Discovering reinforcement learning algorithms

J Oh, M Hessel, WM Czarnecki, Z Xu… - Advances in …, 2020 - proceedings.neurips.cc

… -learning learning algorithms) in AI-GAs [7]. However, we aim to achieve generalisation
not just across tasks but also across different domains. Learning domain-invariant algorithms …

被引用次数：142 相关文章所有 8 个版本

[PDF] bookfusion.com

[图书][B] Algorithms for reinforcement learning

C Szepesvári - 2022 - books.google.com

… Reinforcement learning is of great … algorithms of reinforcement learning that build on the
powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning …

被引用次数：2103 相关文章所有 24 个版本

[PDF] arxiv.org

A survey of reinforcement learning algorithms for dynamically varying environments

S Padakandla - ACM Computing Surveys (CSUR), 2021 - dl.acm.org

… Reinforcement learning … reinforcement learning techniques for tackling dynamically changing
environment contexts in a system. The focus is on a single autonomous RL agent learning …

被引用次数：133 相关文章所有 6 个版本

[PDF] arxiv.org

Measuring the reliability of reinforcement learning algorithms

SCY Chan, S Fishman, J Canny, A Korattikara… - arXiv preprint arXiv …, 2019 - arxiv.org

… well-known issue for reinforcement learning (RL) algorithms. This … Reinforcement Learning
algorithms vary widely in design, … certain notions that should span the gamut of RL algorithms. …

被引用次数：90 相关文章所有 3 个版本

[PDF] arxiv.org

A hitchhiker's guide to statistical comparisons of reinforcement learning algorithms

C Colas, O Sigaud, PY Oudeyer - arXiv preprint arXiv:1904.06979, 2019 - arxiv.org

… guide to rigorous comparisons of reinforcement learning algorithms. After introducing the
concepts … guidelines and code to perform rigorous comparisons of RL algorithm performances. …

被引用次数：87 相关文章所有 7 个版本

[PDF] arxiv.org

Benchmarking batch deep reinforcement learning algorithms

S Fujimoto, E Conti, M Ghavamzadeh… - arXiv preprint arXiv …, 2019 - arxiv.org

… and batch reinforcement learning algorithms under unified … We find that under these
conditions, many of these algorithms … Batch-Constrained Q-learning algorithm to a discrete-action …

被引用次数：191 相关文章所有 2 个版本

[PDF] arxiv.org

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org

… different types of reinforcement learning algorithms and present definitions. At a high level,
all standard reinforcement learning algorithms follow the same basic learning loop: the agent …

被引用次数：1729 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] Deep learning, reinforcement learning, and world models

Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver… - Neural Networks, 2022 - Elsevier

… deep learning and reinforcement learning algorithms. Speakers contributed to provide talks
about their recent studies that can be key technologies to achieve human-level intelligence. …

被引用次数：226 相关文章所有 7 个版本

[PDF] jmlr.org

Cleanrl: High-quality single-file implementations of deep reinforcement learning algorithms

S Huang, RFJ Dossa, C Ye, J Braga… - … of Machine Learning …, 2022 - jmlr.org

… In recent years, Deep Reinforcement Learning (DRL) algorithms have achieved great suc…
Nevertheless, understanding all the implementation details of an algorithm remains difficult …

被引用次数：170 相关文章所有 4 个版本

[PDF] arxiv.org

Reinforcement learning algorithm for non-stationary environments

S Padakandla, P KJ, S Bhatnagar - Applied Intelligence, 2020 - Springer

… a model-free learning algorithm to learn an approximately optimal policy. We propose the
use of Q-learning (QL) [44], a model-free iterative RL algorithm to obtain the experience tuples. …

被引用次数：136 相关文章所有 9 个版本

高级搜索

QQ 群

Discovering reinforcement learning algorithms

[图书][B] Algorithms for reinforcement learning

A survey of reinforcement learning algorithms for dynamically varying environments

Measuring the reliability of reinforcement learning algorithms

A hitchhiker's guide to statistical comparisons of reinforcement learning algorithms

Benchmarking batch deep reinforcement learning algorithms

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

[HTML][HTML] Deep learning, reinforcement learning, and world models

Cleanrl: High-quality single-file implementations of deep reinforcement learning algorithms

Reinforcement learning algorithm for non-stationary environments

相关搜索

引用