Accelerating deep reinforcement learning via knowledge-guided policy network

Y Yu, P Zhang, K Zhao, Y Zheng, J Hao - Autonomous Agents and Multi …, 2023 - Springer
Deep reinforcement learning has contributed to dramatic advances in many tasks, such as
playing games, controlling robots, and navigating complex environments. However, it …

[HTML][HTML] Arlo: A framework for automated reinforcement learning

M Mussi, D Lombarda, AM Metelli, F Trovò… - Expert Systems with …, 2023 - Elsevier
Abstract Automated Reinforcement Learning (AutoRL) is a relatively new area of research
that is gaining increasing attention. The objective of AutoRL consists in easing the …

Reincarnating reinforcement learning: Reusing prior computation to accelerate progress

R Agarwal, M Schwarzer, PS Castro… - Advances in neural …, 2022 - proceedings.neurips.cc
Learning tabula rasa, that is without any prior knowledge, is the prevalent workflow in
reinforcement learning (RL) research. However, RL systems, when applied to large-scale …

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

S Huang, Q Gallouédec, F Felten, A Raffin… - arXiv preprint arXiv …, 2024 - arxiv.org
In many Reinforcement Learning (RL) papers, learning curves are useful indicators to
measure the effectiveness of RL algorithms. However, the complete raw data of the learning …

The MineRL 2019 competition on sample efficient reinforcement learning using human priors

WH Guss, C Codel, K Hofmann, B Houghton… - arXiv preprint arXiv …, 2019 - arxiv.org
Though deep reinforcement learning has led to breakthroughs in many difficult domains,
these successes have required an ever-increasing number of samples. As state-of-the-art …

[PDF][PDF] Exploration from demonstration for interactive reinforcement learning

K Subramanian, CL Isbell Jr… - Proceedings of the 2016 …, 2016 - umiacs.umd.edu
Reinforcement Learning (RL) has been effectively used to solve complex problems given
careful design of the problem and algorithm parameters. However standard RL approaches …

Behavior-guided actor-critic: Improving exploration via learning policy behavior representation for deep reinforcement learning

A Fayad, M Ibrahim - arXiv preprint arXiv:2104.04424, 2021 - arxiv.org
In this work, we propose Behavior-Guided Actor-Critic (BAC), an off-policy actor-critic deep
RL algorithm. BAC mathematically formulates the behavior of the policy through …

Challenges of real-world reinforcement learning

G Dulac-Arnold, D Mankowitz, T Hester - arXiv preprint arXiv:1904.12901, 2019 - arxiv.org
Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is
beginning to show some successes in real-world scenarios. However, much of the research …

[PDF][PDF] Datasets for data-driven reinforcement learning

J Fu, A Kumar, O Nachum, G Tucker… - arXiv preprint arXiv …, 2020 - ask.qcloudimg.com
The offline reinforcement learning (RL) problem, also referred to as batch RL, refers to the
setting where a policy must be learned from a dataset of previously collected data, without …

[图书][B] Mastering reinforcement learning with python: build next-generation, self-learning models using reinforcement learning techniques and best practices

E Bilgin - 2020 - books.google.com
Get hands-on experience in creating state-of-the-art reinforcement learning agents using
TensorFlow and RLlib to solve complex real-world business and industry problems with the …