Discovering reinforcement learning algorithms

J Oh, M Hessel, WM Czarnecki, Z Xu… - Advances in …, 2020 - proceedings.neurips.cc
… -learning learning algorithms) in AI-GAs [7]. However, we aim to achieve generalisation
not just across tasks but also across different domains. Learning domain-invariant algorithms

[图书][B] Algorithms for reinforcement learning

C Szepesvári - 2022 - books.google.com
Reinforcement learning is of great … algorithms of reinforcement learning that build on the
powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning

A survey of reinforcement learning algorithms for dynamically varying environments

S Padakandla - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Reinforcement learningreinforcement learning techniques for tackling dynamically changing
environment contexts in a system. The focus is on a single autonomous RL agent learning

Measuring the reliability of reinforcement learning algorithms

SCY Chan, S Fishman, J Canny, A Korattikara… - arXiv preprint arXiv …, 2019 - arxiv.org
… well-known issue for reinforcement learning (RL) algorithms. This … Reinforcement Learning
algorithms vary widely in design, … certain notions that should span the gamut of RL algorithms. …

A hitchhiker's guide to statistical comparisons of reinforcement learning algorithms

C Colas, O Sigaud, PY Oudeyer - arXiv preprint arXiv:1904.06979, 2019 - arxiv.org
… guide to rigorous comparisons of reinforcement learning algorithms. After introducing the
concepts … guidelines and code to perform rigorous comparisons of RL algorithm performances. …

Benchmarking batch deep reinforcement learning algorithms

S Fujimoto, E Conti, M Ghavamzadeh… - arXiv preprint arXiv …, 2019 - arxiv.org
… and batch reinforcement learning algorithms under unified … We find that under these
conditions, many of these algorithms … Batch-Constrained Q-learning algorithm to a discrete-action …

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org
… different types of reinforcement learning algorithms and present definitions. At a high level,
all standard reinforcement learning algorithms follow the same basic learning loop: the agent …

[HTML][HTML] Deep learning, reinforcement learning, and world models

Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver… - Neural Networks, 2022 - Elsevier
… deep learning and reinforcement learning algorithms. Speakers contributed to provide talks
about their recent studies that can be key technologies to achieve human-level intelligence. …

Cleanrl: High-quality single-file implementations of deep reinforcement learning algorithms

S Huang, RFJ Dossa, C Ye, J Braga… - … of Machine Learning …, 2022 - jmlr.org
… In recent years, Deep Reinforcement Learning (DRL) algorithms have achieved great suc…
Nevertheless, understanding all the implementation details of an algorithm remains difficult …

Reinforcement learning algorithm for non-stationary environments

S Padakandla, P KJ, S Bhatnagar - Applied Intelligence, 2020 - Springer
… a model-free learning algorithm to learn an approximately optimal policy. We propose the
use of Q-learning (QL) [44], a model-free iterative RL algorithm to obtain the experience tuples. …