Direct then diffuse: Incremental unsupervised skill discovery for state covering and goal reaching

S Park, K Lee, Y Lee, P Abbeel - arXiv preprint arXiv:2302.05103, 2023 - arxiv.org

One of the key capabilities of intelligent agents is the ability to discover useful skills without
external supervision. However, the current unsupervised skill discovery methods are often …

被引用次数：46 相关文章所有 7 个版本

[PDF] arxiv.org

Metra: Scalable unsupervised rl with metric-aware abstraction

S Park, O Rybkin, S Levine - arXiv preprint arXiv:2310.08887, 2023 - arxiv.org

Unsupervised pre-training strategies have proven to be highly effective in natural language
processing and computer vision. Likewise, unsupervised reinforcement learning (RL) holds …

被引用次数：29 相关文章所有 4 个版本

[PDF] neurips.cc

Learning to discover skills through guidance

H Kim, BK Lee, H Lee, D Hwang… - Advances in …, 2024 - proceedings.neurips.cc

In the field of unsupervised skill discovery (USD), a major challenge is limited exploration,
primarily due to substantial penalties when skills deviate from their initial trajectories. To …

被引用次数：8 相关文章所有 5 个版本

[PDF] arxiv.org

Neuroevolution is a competitive alternative to reinforcement learning for skill discovery

F Chalumeau, R Boige, B Lim, V Macé, M Allard… - arXiv preprint arXiv …, 2022 - arxiv.org

Deep Reinforcement Learning (RL) has emerged as a powerful paradigm for training neural
policies to solve complex control tasks. However, these policies tend to be overfit to the …

被引用次数：22 相关文章所有 4 个版本

[PDF] arxiv.org

Planning goals for exploration

ES Hu, R Chang, O Rybkin, D Jayaraman - arXiv preprint arXiv …, 2023 - arxiv.org

Dropped into an unknown environment, what should an agent do to quickly learn about the
environment and how to accomplish diverse tasks within it? We address this question within …

被引用次数：28 相关文章所有 4 个版本

[PDF] neurips.cc

Unsupervised skill discovery via recurrent skill training

Z Jiang, J Gao, J Chen - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Being able to discover diverse useful skills without external reward functions is beneficial in
reinforcement learning research. Previous unsupervised skill discovery approaches mainly …

被引用次数：16 相关文章所有 3 个版本

[PDF] arxiv.org

Pretraining in deep reinforcement learning: A survey

Z Xie, Z Lin, J Li, S Li, D Ye - arXiv preprint arXiv:2211.03959, 2022 - arxiv.org

The past few years have seen rapid progress in combining reinforcement learning (RL) with
deep learning. Various breakthroughs ranging from games to robotics have spurred the …

被引用次数：21 相关文章所有 2 个版本

[PDF] neurips.cc

A unified algorithm framework for unsupervised discovery of skills based on determinantal point process

J Chen, V Aggarwal, T Lan - Advances in Neural …, 2024 - proceedings.neurips.cc

Learning rich skills under the option framework without supervision of external rewards is at
the frontier of reinforcement learning research. Existing works mainly fall into two distinctive …

被引用次数：2 相关文章所有 7 个版本

[PDF] mlr.press

Temporal abstractions-augmented temporally contrastive learning: An alternative to the Laplacian in RL

A Erraqabi, MC Machado, M Zhao… - Uncertainty in …, 2022 - proceedings.mlr.press

In reinforcement learning, the graph Laplacian has proved to be a valuable tool in the task-
agnostic setting, with applications ranging from skill discovery to reward shaping. Recently …

被引用次数：7 相关文章所有 6 个版本

Reference RL: Reinforcement learning with reference mechanism and its application in traffic signal control

Y Lu, A Hegyi, AM Salomons, H Wang - Information Sciences, 2025 - Elsevier

This paper addresses the challenges of deploying reinforcement learning (RL) models for
traffic signal control (TSC) in real-world environments. Real-world training can prevent …

高级搜索

QQ 群