VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought

GH Sarch, L Jang, MJ Tarr, WW Cohen… - The Thirty-eighth …, 2024 - openreview.net
Large-scale generative language and vision-language models (LLMs and VLMs) excel in
few-shot in-context learning for decision making and instruction following. However, they …

DABI: Evaluation of Data Augmentation Methods Using Downsampling in Bilateral Control-Based Imitation Learning with Images

M Kobayashi, T Buamanee, Y Uranishi - arXiv preprint arXiv:2410.04370, 2024 - arxiv.org
Autonomous robot manipulation is a complex and continuously evolving robotics field. This
paper focuses on data augmentation methods in imitation learning. Imitation learning …

VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs

G Sarch, L Jang, MJ Tarr, WW Cohen, K Marino… - arXiv preprint arXiv …, 2024 - arxiv.org
Large-scale generative language and vision-language models excel in in-context learning
for decision making. However, they require high-quality exemplar demonstrations to be …

Language-guided Robust Navigation for Mobile Robots in Dynamically-changing Environments

C Simons, Z Liu, B Marcus, AK Roy-Chowdhury… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we develop an embodied AI system for human-in-the-loop navigation with a
wheeled mobile robot. We propose a direct yet effective method of monitoring the robot's …

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning

Y Dai, J Lee, N Fazeli, J Chai - arXiv preprint arXiv:2409.14674, 2024 - arxiv.org
Developing robust and correctable visuomotor policies for robotic manipulation is
challenging due to the lack of self-recovery mechanisms from failures and the limitations of …

Incremental Learning for Robot Shared Autonomy

Y Tao, G Qiao, D Ding, Z Erickson - arXiv preprint arXiv:2410.06315, 2024 - arxiv.org
Shared autonomy holds promise for improving the usability and accessibility of assistive
robotic arms, but current methods often rely on costly expert demonstrations and lack the …

Task Success Prediction and Open-Vocabulary Object Manipulation

M Kambara, K Sugiura - arXiv preprint arXiv:2412.19112, 2024 - arxiv.org
This study addresses a task designed to predict the future success or failure of open-
vocabulary object manipulation. In this task, the model is required to make predictions based …

Diff-DAgger: Uncertainty Estimation with Diffusion Policy for Robotic Manipulation

SW Lee, YL Kuo - arXiv preprint arXiv:2410.14868, 2024 - arxiv.org
Recently, diffusion policy has shown impressive results in handling multi-modal tasks in
robotic manipulation. However, it has fundamental limitations in out-of-distribution failures …

Scalable Lifelong Imitation Learning for Robot Fleets

R Hoque - 2024 - search.proquest.com
Recent breakthroughs in deep learning have revolutionized natural language processing,
computer vision, and robotics. Nevertheless, reliable robot autonomy in unstructured …

Multi-Strategy Deployment-Time Learning and Adaptation for Navigation under Uncertainty

A Paudel, X Xiao, GJ Stein - 8th Annual Conference on Robot Learning - openreview.net
We present an approach for performant point-goal navigation in unfamiliar partially-mapped
environments. When deployed, our robot runs multiple strategies for deployment-time …