Become a proficient player with limited data through watching pure videos

J Wu, H Ma, C Deng, M Long - Advances in Neural …, 2024 - proceedings.neurips.cc

Unsupervised pre-training methods utilizing large and diverse datasets have achieved
tremendous success across a range of domains. Recent work has investigated such …

被引用次数：26 相关文章所有 7 个版本

[PDF] openreview.net

Position: video as the new language for real-world decision making

S Yang, JC Walker, J Parker-Holder, Y Du… - … on Machine Learning, 2024 - openreview.net

Both text and video data are abundant on the internet and support large-scale self-
supervised learning through next token or frame prediction. However, they have not been …

被引用次数：1 相关文章

[PDF] neurips.cc

Unsupervised behavior extraction via random intent priors

H Hu, Y Yang, J Ye, Z Mai… - Advances in Neural …, 2023 - proceedings.neurips.cc

Reward-free data is abundant and contains rich prior knowledge of human behaviors, but it
is not well exploited by offline reinforcement learning (RL) algorithms. In this paper, we …

被引用次数：5 相关文章所有 7 个版本

[PDF] arxiv.org

Learning to act without actions

D Schmidt, M Jiang - arXiv preprint arXiv:2312.10812, 2023 - arxiv.org

Pre-training large models on vast amounts of web data has proven to be an effective
approach for obtaining powerful, general models in several domains, including language …

被引用次数：19 相关文章所有 4 个版本

[PDF] openreview.net

Foundation reinforcement learning: towards embodied generalist agents with foundation prior assistance

W Ye, Y Zhang, M Wang, S Wang, X Gu, P Abbeel… - 2023 - openreview.net

Recently, people have shown that large-scale pre-training from diverse internet-scale data is
the key to building a generalist model, as witnessed in the natural language processing …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Pre-trained Visual Dynamics Representations for Efficient Policy Learning

H Luo, B Zhou, Z Lu - European Conference on Computer Vision, 2025 - Springer

Abstract Pre-training for Reinforcement Learning (RL) with purely video data is a valuable
yet challenging problem. Although in-the-wild videos are readily available and inhere a vast …

Jafar: An open-source genie reimplemention in jax

T Willi, MT Jackson, JN Foerster - First Workshop on Controllable …, 2024 - openreview.net

We introduce Jafar, an open-source Jax reimplementation of Genie, a foundational world
model. Genie was the first world model trained in an unsupervised manner on unlabelled …

被引用次数：1 相关文章

[PDF] aaai.org

Robust Visual Imitation Learning with Inverse Dynamics Representations

S Li, X Wang, R Zuo, K Sun, L Cui, J Ding… - Proceedings of the …, 2024 - ojs.aaai.org

Imitation learning (IL) has achieved considerable success in solving complex sequential
decision-making problems. However, current IL methods mainly assume that the …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

D Kim, H Lee, K Lee, D Hwang, J Choo - arXiv preprint arXiv:2406.06037, 2024 - arxiv.org

Recently, various pre-training methods have been introduced in vision-based
Reinforcement Learning (RL). However, their generalization ability remains unclear due to …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Towards Principled Representation Learning from Videos for Reinforcement Learning

D Misra, A Saran, T Xie, A Lamb, J Langford - arXiv preprint arXiv …, 2024 - arxiv.org

We study pre-training representations for decision-making using video data, which is
abundantly available for tasks such as game agents and software testing. Even though …

被引用次数：1 相关文章所有 3 个版本

高级搜索

QQ 群