Adaptive rollout length for model-based rl using model-free deep rl

A Bhatia, PS Thomas, S Zilberstein - arXiv preprint arXiv:2206.02380, 2022 - arxiv.org
Model-based reinforcement learning promises to learn an optimal policy from fewer
interactions with the environment compared to model-free reinforcement learning by …

A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

Deep reinforcement learning with adaptive update target combination

Z Xu, L Cao, X Chen - The Computer Journal, 2020 - academic.oup.com
Simple and efficient exploration remains a core challenge in deep reinforcement learning.
While many exploration methods can be applied to high-dimensional tasks, these methods …

[图书][B] A Transfer Learning Framework for Human-Centric Deep Reinforcement Learning with Reward Engineering

MS Ausin - 2021 - search.proquest.com
Abstract In recent years, Reinforcement Learning (RL), and especially Deep RL (DRL), have
shown outstanding performance in video games from Atari, Mario, to StarCraft. Most of the …

On the importance of exploration for generalization in reinforcement learning

Y Jiang, JZ Kolter, R Raileanu - Advances in Neural …, 2024 - proceedings.neurips.cc
Existing approaches for improving generalization in deep reinforcement learning (RL) have
mostly focused on representation learning, neglecting RL-specific aspects such as …

Deep reinforcement learning: a survey

H Wang, N Liu, Y Zhang, D Feng, F Huang, D Li… - Frontiers of Information …, 2020 - Springer
Deep reinforcement learning (RL) has become one of the most popular topics in artificial
intelligence research. It has been widely used in various fields, such as end-to-end control …

Hypothesis-driven skill discovery for hierarchical deep reinforcement learning

C Chuck, S Chockchowwat… - 2020 IEEE/RSJ …, 2020 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) is capable of learning high-performing policies on a
variety of complex high-dimensional tasks, ranging from video games to robotic …

Offworld gym: open-access physical robotics environment for real-world reinforcement learning benchmark and research

A Kumar, T Buckley, JB Lanier, Q Wang… - arXiv preprint arXiv …, 2019 - arxiv.org
Success stories of applied machine learning can be traced back to the datasets and
environments that were put forward as challenges for the community. The challenge that the …

An OCBA-Based Method for Efficient Sample Collection in Reinforcement Learning

K Li, X Jin, QS Jia, D Ren, H Xia - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
This work focuses on the sample collection in reinforcement learning (RL), where the
interaction with the environment is typically time-consuming and extravagantly expensive. In …

Bayesian curiosity for efficient exploration in reinforcement learning

T Blau, L Ott, F Ramos - arXiv preprint arXiv:1911.08701, 2019 - arxiv.org
Balancing exploration and exploitation is a fundamental part of reinforcement learning, yet
most state-of-the-art algorithms use a naive exploration protocol like $\epsilon $-greedy …