Playfusion: Skill acquisition via diffusion from language-annotated play

L Chen, S Bahl, D Pathak - Conference on Robot Learning, 2023 - proceedings.mlr.press
Learning from unstructured and uncurated data has become the dominant paradigm for
generative approaches in language or vision. Such unstructured and unguided behavior …

Human instruction-following with deep reinforcement learning via transfer-learning from text

F Hill, S Mokra, N Wong, T Harley - arXiv preprint arXiv:2005.09382, 2020 - arxiv.org
Recent work has described neural-network-based agents that are trained with reinforcement
learning (RL) to execute language-like commands in simulated worlds, as a step towards an …

Translating natural language instructions to computer programs for robot manipulation

SG Venkatesh, R Upadrashta… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
It is highly desirable for robots that work alongside humans to be able to understand
instructions in natural language. Existing language conditioned imitation learning models …

AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents

J Cui, T Liu, N Liu, Y Yang, Y Zhu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Traditional approaches in physics-based motion generation centered around imitation
learning and reward shaping often struggle to adapt to new scenarios. To tackle this …

R3m: A universal visual representation for robot manipulation

S Nair, A Rajeswaran, V Kumar, C Finn… - arXiv preprint arXiv …, 2022 - arxiv.org
We study how visual representations pre-trained on diverse human video data can enable
data-efficient learning of downstream robotic manipulation tasks. Concretely, we pre-train a …

Object-centric instruction augmentation for robotic manipulation

J Wen, Y Zhu, M Zhu, J Li, Z Xu, Z Che, C Shen… - arXiv preprint arXiv …, 2024 - arxiv.org
Humans interpret scenes by recognizing both the identities and positions of objects in their
observations. For a robot to perform tasks such as\enquote {pick and place}, understanding …

Scaling up multi-task robotic reinforcement learning

D Kalashnikov, J Varley, Y Chebotar… - … Conference on Robot …, 2021 - openreview.net
General-purpose robotic systems must master a large repertoire of diverse skills. While
reinforcement learning provides a powerful framework for acquiring individual behaviors, the …

Vima: Robot manipulation with multimodal prompts

Y Jiang, A Gupta, Z Zhang, G Wang, Y Dou, Y Chen… - 2023 - openreview.net
Prompt-based learning has emerged as a successful paradigm in natural language
processing, where a single general-purpose language model can be instructed to perform …

Transformer adapters for robot learning

A Liang, I Singh, K Pertsch… - CoRL 2022 Workshop on …, 2022 - openreview.net
Large transformer-based architectures are capable of complex robot task planning and low-
level control. In the natural language processing (NLP) community, fine-tuning large …

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs

W Xia, D Wang, X Pang, Z Wang, B Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Generalizable articulated object manipulation is essential for home-assistant robots. Recent
efforts focus on imitation learning from demonstrations or reinforcement learning in …