A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

Goal-conditioned reinforcement learning: Problems and solutions

M Liu, M Zhu, W Zhang - arXiv preprint arXiv:2201.08299, 2022 - arxiv.org
Goal-conditioned reinforcement learning (GCRL), related to a set of complex RL problems,
trains an agent to achieve different goals under particular scenarios. Compared to the …

Diffusion models for reinforcement learning: A survey

Z Zhu, H Zhao, H He, Y Zhong, S Zhang, H Guo… - arXiv preprint arXiv …, 2023 - arxiv.org
Diffusion models surpass previous generative models in sample quality and training
stability. Recent works have shown the advantages of diffusion models in improving …

Interactive policy shaping for human-robot collaboration with transparent matrix overlays

J Brawer, D Ghose, K Candon, M Qin… - Proceedings of the …, 2023 - dl.acm.org
One important aspect of effective human--robot collaborations is the ability for robots to
adapt quickly to the needs of humans. While techniques like deep reinforcement learning …

Pre-training goal-based models for sample-efficient reinforcement learning

H Yuan, Z Mu, F Xie, Z Lu - The Twelfth International Conference on …, 2024 - openreview.net
Pre-training on task-agnostic large datasets is a promising approach for enhancing the
sample efficiency of reinforcement learning (RL) in solving complex tasks. We present …

MHER: Model-based hindsight experience replay

R Yang, M Fang, L Han, Y Du, F Luo, X Li - arXiv preprint arXiv …, 2021 - arxiv.org
Solving multi-goal reinforcement learning (RL) problems with sparse rewards is generally
challenging. Existing approaches have utilized goal relabeling on collected experiences to …

Learning diverse policies in moba games via macro-goals

Y Gao, B Shi, X Du, L Wang, G Chen… - Advances in …, 2021 - proceedings.neurips.cc
Recently, many researchers have made successful progress in building the AI systems for
MOBA-game-playing with deep reinforcement learning, such as on Dota 2 and Honor of …

Efficient hierarchical reinforcement learning for mapless navigation with predictive neighbouring space scoring

Y Gao, J Wu, X Yang, Z Ji - IEEE Transactions on Automation …, 2023 - ieeexplore.ieee.org
Solving reinforcement learning (RL)-based mapless navigation tasks is challenging due to
their sparse reward and long decision horizon nature. Hierarchical reinforcement learning …

Long-Term Feature Extraction Via Frequency Prediction for Efficient Reinforcement Learning

J Wang, M Ye, Y Kuang, R Yang… - … on Pattern Analysis …, 2025 - ieeexplore.ieee.org
Sample efficiency remains a key challenge for the deployment of deep reinforcement
learning (RL) in real-world scenarios. A common approach is to learn efficient …

Invariant Representations Learning with Future Dynamics

W Hu, M He, X Chen, N Wang - Engineering Applications of Artificial …, 2024 - Elsevier
High-dimension inputs limit the sample efficiency of deep reinforcement learning, increasing
the difficulty of applying it to real-world continuous control tasks, especially in uncertain …