Deep reinforcement learning for robotics: A survey of real-world successes

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org
Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

Octo: An open-source generalist robot policy

OM Team, D Ghosh, H Walke, K Pertsch… - arXiv preprint arXiv …, 2024 - arxiv.org
Large policies pretrained on diverse robot datasets have the potential to transform robotic
learning: instead of training new policies from scratch, such generalist robot policies may be …

Rdt-1b: a diffusion foundation model for bimanual manipulation

S Liu, L Wu, B Li, H Tan, H Chen, Z Wang, K Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
Bimanual manipulation is essential in robotics, yet developing foundation models is
extremely challenging due to the inherent complexity of coordinating two robot arms …

Igor: Image-goal representations are the atomic control units for foundation models in embodied ai

X Chen, J Guo, T He, C Zhang, P Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce Image-GOal Representations (IGOR), aiming to learn a unified, semantically
consistent action space across human and various robots. Through this unified latent action …

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Q Li, Y Liang, Z Wang, L Luo, X Chen, M Liao… - arXiv preprint arXiv …, 2024 - arxiv.org
The advancement of large Vision-Language-Action (VLA) models has significantly improved
robotic manipulation in terms of language-guided task execution and generalization to …

The colosseum: A benchmark for evaluating generalization for robotic manipulation

W Pumacay, I Singh, J Duan, R Krishna… - arXiv preprint arXiv …, 2024 - arxiv.org
To realize effective large-scale, real-world robotic applications, we must evaluate how well
our robot policies adapt to changes in environmental conditions. Unfortunately, a majority of …

A survey on robotics with foundation models: toward embodied ai

Z Xu, K Wu, J Wen, J Li, N Liu, Z Che, J Tang - arXiv preprint arXiv …, 2024 - arxiv.org
While the exploration for embodied AI has spanned multiple decades, it remains a persistent
challenge to endow agents with human-level intelligence, including perception, learning …

Prediction with Action: Visual Policy Learning via Joint Denoising Process

Y Guo, Y Hu, J Zhang, YJ Wang, X Chen, C Lu… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion models have demonstrated remarkable capabilities in image generation tasks,
including image editing and video creation, representing a good understanding of the …

RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios

L Zheng, F Yan, F Liu, C Feng, Z Kang, L Ma - arXiv preprint arXiv …, 2024 - arxiv.org
Foundation models hold significant potential for enabling robots to perform long-horizon
general manipulation tasks. However, the simplicity of tasks and the uniformity of …

Motion planning for 7-degree-of-freedom bionic arm: Deep deterministic policy gradient algorithm based on imitation of human action

B Li, S Qiu, H Ye, Y Guo, H Wang, J Bai - Engineering Applications of …, 2025 - Elsevier
Smart bionic arms have played a great role in returning amputees to society. However, most
of the current bionic arms are radial configuration type with few degrees of freedom and …