Robot utility models: General policies for zero-shot deployment in new environments

H Etukuru, N Naka, Z Hu, S Lee, J Mehu… - arXiv preprint arXiv …, 2024 - arxiv.org
Robot models, particularly those trained with large amounts of data, have recently shown a
plethora of real-world manipulation and navigation capabilities. Several independent efforts …

Diffusion policy policy optimization

AZ Ren, J Lidard, LL Ankile, A Simeonov… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce Diffusion Policy Policy Optimization, DPPO, an algorithmic framework
including best practices for fine-tuning diffusion-based policies (eg Diffusion Policy) in …

Latent action pretraining from videos

S Ye, J Jang, B Jeon, S Joo, J Yang, B Peng… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce Latent Action Pretraining for general Action models (LAPA), an unsupervised
method for pretraining Vision-Language-Action (VLA) models without ground-truth robot …

Dreamitate: Real-world visuomotor policy learning via video generation

J Liang, R Liu, E Ozguroglu, S Sudhakar… - arXiv preprint arXiv …, 2024 - arxiv.org
A key challenge in manipulation is learning a policy that can robustly generalize to diverse
visual environments. A promising mechanism for learning robust policies is to leverage …

Scaling proprioceptive-visual learning with heterogeneous pre-trained transformers

L Wang, X Chen, J Zhao, K He - arXiv preprint arXiv:2409.20537, 2024 - arxiv.org
One of the roadblocks for training generalist robotic models today is heterogeneity. Previous
robot learning methods often collect data to train with one specific embodiment for one task …

Deep generative models in robotics: A survey on learning from multimodal demonstrations

J Urain, A Mandlekar, Y Du, M Shafiullah, D Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
Learning from Demonstrations, the field that proposes to learn robot behavior models from
data, is gaining popularity with the emergence of deep generative models. Although the …

Quest: Self-supervised skill abstractions for learning continuous control

A Mete, H Xue, A Wilcox, Y Chen, A Garg - arXiv preprint arXiv …, 2024 - arxiv.org
Generalization capabilities, or rather a lack thereof, is one of the most important unsolved
problems in the field of robot learning, and while several large scale efforts have set out to …

Mmp++: Motion manifold primitives with parametric curve models

Y Lee - IEEE Transactions on Robotics, 2024 - ieeexplore.ieee.org
Motion manifold primitives (MMP), a manifold-based approach for encoding basic motion
skills, can produce diverse trajectories, enabling the system to adapt to unseen constraints …

From imitation to refinement–residual rl for precise visual assembly

LL Ankile, A Simeonov, I Shenfeld… - … 2024 Workshop on …, 2024 - openreview.net
Recent advances in behavior cloning (BC), like action-chunking and diffusion, have led to
impressive progress. Still, imitation alone remains insufficient for tasks requiring reliable and …

MaIL: Improving Imitation Learning with Selective State Space Models

X Jia, Q Wang, A Donat, B Xing, G Li… - … Conference on Robot …, 2024 - openreview.net
This work introduces Mamba Imitation Learning (MaIL), a novel imitation learning (IL)
architecture that offers a computationally efficient alternative to state-of-the-art (SoTA) …