Drm: Mastering visual reinforcement learning through dormant ratio minimization

Y Liang, Y Sun, R Zheng, X Liu, B Eysenbach… - arXiv preprint arXiv …, 2023 - arxiv.org

Deploying reinforcement learning (RL) systems requires robustness to uncertainty and
model misspecification, yet prior robust RL methods typically only study noise introduced …

被引用次数：4 相关文章所有 5 个版本

[PDF] thecvf.com

UVIS: Unsupervised Video Instance Segmentation

S Huang, S Suri, K Gupta… - Proceedings of the …, 2024 - openaccess.thecvf.com

Video instance segmentation requires classifying segmenting and tracking every object
across video frames. Unlike existing approaches that rely on masks boxes or category labels …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

X Wang, R Zheng, Y Sun, R Jia, W Wongkamjan… - arXiv preprint arXiv …, 2023 - arxiv.org

Dyna-style model-based reinforcement learning contains two phases: model rollouts to
generate sample for policy learning and real environment exploration using current policy …

被引用次数：3 相关文章所有 5 个版本

[PDF] arxiv.org

Premier-taco: Pretraining multitask representation via temporal action-driven contrastive loss

R Zheng, Y Liang, X Wang, S Ma, H Daumé III… - arXiv preprint arXiv …, 2024 - arxiv.org

We present Premier-TACO, a multitask feature representation learning approach designed
to improve few-shot policy learning efficiency in sequential decision-making tasks. Premier …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

高级搜索

QQ 群