Pre-training contextualized world models with in-the-wild videos for reinforcement learning

J Wu, H Ma, C Deng, M Long - Advances in Neural …, 2024 - proceedings.neurips.cc
Unsupervised pre-training methods utilizing large and diverse datasets have achieved
tremendous success across a range of domains. Recent work has investigated such …

Position: video as the new language for real-world decision making

S Yang, JC Walker, J Parker-Holder, Y Du… - … on Machine Learning, 2024 - openreview.net
Both text and video data are abundant on the internet and support large-scale self-
supervised learning through next token or frame prediction. However, they have not been …

Unsupervised behavior extraction via random intent priors

H Hu, Y Yang, J Ye, Z Mai… - Advances in Neural …, 2023 - proceedings.neurips.cc
Reward-free data is abundant and contains rich prior knowledge of human behaviors, but it
is not well exploited by offline reinforcement learning (RL) algorithms. In this paper, we …

Learning to act without actions

D Schmidt, M Jiang - arXiv preprint arXiv:2312.10812, 2023 - arxiv.org
Pre-training large models on vast amounts of web data has proven to be an effective
approach for obtaining powerful, general models in several domains, including language …

Foundation reinforcement learning: towards embodied generalist agents with foundation prior assistance

W Ye, Y Zhang, M Wang, S Wang, X Gu, P Abbeel… - 2023 - openreview.net
Recently, people have shown that large-scale pre-training from diverse internet-scale data is
the key to building a generalist model, as witnessed in the natural language processing …

Pre-trained Visual Dynamics Representations for Efficient Policy Learning

H Luo, B Zhou, Z Lu - European Conference on Computer Vision, 2025 - Springer
Abstract Pre-training for Reinforcement Learning (RL) with purely video data is a valuable
yet challenging problem. Although in-the-wild videos are readily available and inhere a vast …

Jafar: An open-source genie reimplemention in jax

T Willi, MT Jackson, JN Foerster - First Workshop on Controllable …, 2024 - openreview.net
We introduce Jafar, an open-source Jax reimplementation of Genie, a foundational world
model. Genie was the first world model trained in an unsupervised manner on unlabelled …

Robust Visual Imitation Learning with Inverse Dynamics Representations

S Li, X Wang, R Zuo, K Sun, L Cui, J Ding… - Proceedings of the …, 2024 - ojs.aaai.org
Imitation learning (IL) has achieved considerable success in solving complex sequential
decision-making problems. However, current IL methods mainly assume that the …

Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

D Kim, H Lee, K Lee, D Hwang, J Choo - arXiv preprint arXiv:2406.06037, 2024 - arxiv.org
Recently, various pre-training methods have been introduced in vision-based
Reinforcement Learning (RL). However, their generalization ability remains unclear due to …

Towards Principled Representation Learning from Videos for Reinforcement Learning

D Misra, A Saran, T Xie, A Lamb, J Langford - arXiv preprint arXiv …, 2024 - arxiv.org
We study pre-training representations for decision-making using video data, which is
abundantly available for tasks such as game agents and software testing. Even though …