Code as policies: Language model programs for embodied control

J Liang, W Huang, F Xia, P Xu… - … on Robotics and …, 2023 - ieeexplore.ieee.org
Large language models (LLMs) trained on code-completion have been shown to be capable
of synthesizing simple Python programs from docstrings [1]. We find that these code-writing …

Visual language maps for robot navigation

C Huang, O Mees, A Zeng… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Grounding language to the visual observations of a navigating agent can be performed
using off-the-shelf visual-language models pretrained on Internet-scale data (eg, image …

Chain of thought imitation with procedure cloning

MS Yang, D Schuurmans, P Abbeel… - Advances in Neural …, 2022 - proceedings.neurips.cc
Imitation learning aims to extract high-performance policies from logged demonstrations of
expert behavior. It is common to frame imitation learning as a supervised learning problem …

On the role of the action space in robot manipulation learning and sim-to-real transfer

E Aljalbout, F Frank, M Karl… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
We study the choice of action space in robot manipulation learning and sim-to-real transfer.
We define metrics that assess the performance, and examine the emerging properties in the …

Where to start? Transferring simple skills to complex environments

V Vosylius, E Johns - Conference on Robot Learning, 2023 - proceedings.mlr.press
Robot learning provides a number of ways to teach robots simple skills, such as grasping.
However, these skills are usually trained in open, clutter-free environments, and therefore …

Diffusion model-augmented behavioral cloning

HC Wang, SF Chen, MH Hsu, CM Lai… - arXiv preprint arXiv …, 2023 - arxiv.org
Imitation learning addresses the challenge of learning by observing an expert's
demonstrations without access to reward signals from environments. Most existing imitation …

Imitation learning from expert video data for dissection trajectory prediction in endoscopic surgical procedure

J Li, Y Jin, Y Chen, HC Yip, M Scheppach… - … Conference on Medical …, 2023 - Springer
High-level cognitive assistance, such as predicting dissection trajectories in Endoscopic
Submucosal Dissection (ESD), can potentially support and facilitate surgical skills training …

SceneScore: Learning a Cost Function for Object Arrangement

I Kapelyukh, E Johns - arXiv preprint arXiv:2311.08530, 2023 - arxiv.org
Arranging objects correctly is a key capability for robots which unlocks a wide range of
useful tasks. A prerequisite for creating successful arrangements is the ability to evaluate the …

Multimodal time series learning of robots based on distributed and integrated modalities: Verification with a simulator and actual robots

H Ichiwara, H Ito, K Yamamoto, H Mori… - … conference on robotics …, 2023 - ieeexplore.ieee.org
We have developed an autonomous robot motion generation model based on distributed
and integrated multimodal learning. Since each modality used as a robot's senses, such as …

Redundancy-aware Action Spaces for Robot Learning

P Mazzaglia, N Backshall, X Ma, S James - arXiv preprint arXiv …, 2024 - arxiv.org
Joint space and task space control are the two dominant action modes for controlling robot
arms within the robot learning literature. Actions in joint space provide precise control over …