Drm: Mastering visual reinforcement learning through dormant ratio minimization

G Xu, R Zheng, Y Liang, X Wang, Z Yuan, T Ji… - arXiv preprint arXiv …, 2023 - arxiv.org
Visual reinforcement learning (RL) has shown promise in continuous control tasks. Despite
its progress, current algorithms are still unsatisfactory in virtually every aspect of the …

Foundation reinforcement learning: towards embodied generalist agents with foundation prior assistance

W Ye, Y Zhang, M Wang, S Wang, X Gu… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, people have shown that large-scale pre-training from internet-scale data is the key
to building generalist models, as witnessed in NLP. To build embodied generalist agents …

UVIS: Unsupervised Video Instance Segmentation

S Huang, S Suri, K Gupta… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video instance segmentation requires classifying segmenting and tracking every object
across video frames. Unlike existing approaches that rely on masks boxes or category labels …

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

X Wang, R Zheng, Y Sun, R Jia, W Wongkamjan… - arXiv preprint arXiv …, 2023 - arxiv.org
Dyna-style model-based reinforcement learning contains two phases: model rollouts to
generate sample for policy learning and real environment exploration using current policy …

Diffusion Reward: Learning Rewards via Conditional Video Diffusion

T Huang, G Jiang, Y Ze, H Xu - arXiv preprint arXiv:2312.14134, 2023 - arxiv.org
Learning rewards from expert videos offers an affordable and effective solution to specify the
intended behaviors for reinforcement learning tasks. In this work, we propose Diffusion …

Premier-taco: Pretraining multitask representation via temporal action-driven contrastive loss

R Zheng, Y Liang, X Wang, S Ma, H Daumé III… - arXiv preprint arXiv …, 2024 - arxiv.org
We present Premier-TACO, a multitask feature representation learning approach designed
to improve few-shot policy learning efficiency in sequential decision-making tasks. Premier …

Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

D Kim, H Lee, K Lee, D Hwang, J Choo - arXiv preprint arXiv:2406.06037, 2024 - arxiv.org
Recently, various pre-training methods have been introduced in vision-based
Reinforcement Learning (RL). However, their generalization ability remains unclear due to …

Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning

Z Yuan, T Wei, S Cheng, G Zhang, Y Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Can we endow visuomotor robots with generalization capabilities to operate in diverse open-
world scenarios? In this paper, we propose\textbf {Maniwhere}, a generalizable framework …

QueST: Self-Supervised Skill Abstractions for Learning Continuous Control

A Mete, H Xue, A Wilcox, Y Chen, A Garg - arXiv preprint arXiv …, 2024 - arxiv.org
Generalization capabilities, or rather a lack thereof, is one of the most important unsolved
problems in the field of robot learning, and while several large scale efforts have set out to …

iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning

A Scannell, K Kujanpää, Y Zhao, M Nakhaei… - arXiv preprint arXiv …, 2024 - arxiv.org
Learning representations for reinforcement learning (RL) has shown much promise for
continuous control. We propose an efficient representation learning method using only a self …