$\texttt {TACO} $: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning

G Xu, R Zheng, Y Liang, X Wang, Z Yuan, T Ji… - arXiv preprint arXiv …, 2023 - arxiv.org

Visual reinforcement learning (RL) has shown promise in continuous control tasks. Despite
its progress, current algorithms are still unsatisfactory in virtually every aspect of the …

被引用次数：10 相关文章所有 5 个版本

[PDF] arxiv.org

Foundation reinforcement learning: towards embodied generalist agents with foundation prior assistance

W Ye, Y Zhang, M Wang, S Wang, X Gu… - arXiv preprint arXiv …, 2023 - arxiv.org

Recently, people have shown that large-scale pre-training from internet-scale data is the key
to building generalist models, as witnessed in NLP. To build embodied generalist agents …

被引用次数：7 相关文章所有 3 个版本

[PDF] thecvf.com

UVIS: Unsupervised Video Instance Segmentation

S Huang, S Suri, K Gupta… - Proceedings of the …, 2024 - openaccess.thecvf.com

Video instance segmentation requires classifying segmenting and tracking every object
across video frames. Unlike existing approaches that rely on masks boxes or category labels …

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

X Wang, R Zheng, Y Sun, R Jia, W Wongkamjan… - arXiv preprint arXiv …, 2023 - arxiv.org

Dyna-style model-based reinforcement learning contains two phases: model rollouts to
generate sample for policy learning and real environment exploration using current policy …

被引用次数：3 相关文章所有 5 个版本

[PDF] arxiv.org

Diffusion Reward: Learning Rewards via Conditional Video Diffusion

T Huang, G Jiang, Y Ze, H Xu - arXiv preprint arXiv:2312.14134, 2023 - arxiv.org

Learning rewards from expert videos offers an affordable and effective solution to specify the
intended behaviors for reinforcement learning tasks. In this work, we propose Diffusion …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Premier-taco: Pretraining multitask representation via temporal action-driven contrastive loss

R Zheng, Y Liang, X Wang, S Ma, H Daumé III… - arXiv preprint arXiv …, 2024 - arxiv.org

We present Premier-TACO, a multitask feature representation learning approach designed
to improve few-shot policy learning efficiency in sequential decision-making tasks. Premier …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

高级搜索

QQ 群