Controllability-aware unsupervised skill discovery

S Park, K Lee, Y Lee, P Abbeel - arXiv preprint arXiv:2302.05103, 2023 - arxiv.org
One of the key capabilities of intelligent agents is the ability to discover useful skills without
external supervision. However, the current unsupervised skill discovery methods are often …

Metra: Scalable unsupervised rl with metric-aware abstraction

S Park, O Rybkin, S Levine - arXiv preprint arXiv:2310.08887, 2023 - arxiv.org
Unsupervised pre-training strategies have proven to be highly effective in natural language
processing and computer vision. Likewise, unsupervised reinforcement learning (RL) holds …

Learning to discover skills through guidance

H Kim, BK Lee, H Lee, D Hwang… - Advances in …, 2024 - proceedings.neurips.cc
In the field of unsupervised skill discovery (USD), a major challenge is limited exploration,
primarily due to substantial penalties when skills deviate from their initial trajectories. To …

Neuroevolution is a competitive alternative to reinforcement learning for skill discovery

F Chalumeau, R Boige, B Lim, V Macé, M Allard… - arXiv preprint arXiv …, 2022 - arxiv.org
Deep Reinforcement Learning (RL) has emerged as a powerful paradigm for training neural
policies to solve complex control tasks. However, these policies tend to be overfit to the …

Planning goals for exploration

ES Hu, R Chang, O Rybkin, D Jayaraman - arXiv preprint arXiv …, 2023 - arxiv.org
Dropped into an unknown environment, what should an agent do to quickly learn about the
environment and how to accomplish diverse tasks within it? We address this question within …

Unsupervised skill discovery via recurrent skill training

Z Jiang, J Gao, J Chen - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Being able to discover diverse useful skills without external reward functions is beneficial in
reinforcement learning research. Previous unsupervised skill discovery approaches mainly …

Pretraining in deep reinforcement learning: A survey

Z Xie, Z Lin, J Li, S Li, D Ye - arXiv preprint arXiv:2211.03959, 2022 - arxiv.org
The past few years have seen rapid progress in combining reinforcement learning (RL) with
deep learning. Various breakthroughs ranging from games to robotics have spurred the …

A unified algorithm framework for unsupervised discovery of skills based on determinantal point process

J Chen, V Aggarwal, T Lan - Advances in Neural …, 2024 - proceedings.neurips.cc
Learning rich skills under the option framework without supervision of external rewards is at
the frontier of reinforcement learning research. Existing works mainly fall into two distinctive …

Temporal abstractions-augmented temporally contrastive learning: An alternative to the Laplacian in RL

A Erraqabi, MC Machado, M Zhao… - Uncertainty in …, 2022 - proceedings.mlr.press
In reinforcement learning, the graph Laplacian has proved to be a valuable tool in the task-
agnostic setting, with applications ranging from skill discovery to reward shaping. Recently …

Reference RL: Reinforcement learning with reference mechanism and its application in traffic signal control

Y Lu, A Hegyi, AM Salomons, H Wang - Information Sciences, 2025 - Elsevier
This paper addresses the challenges of deploying reinforcement learning (RL) models for
traffic signal control (TSC) in real-world environments. Real-world training can prevent …