Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

A review of robot learning for manipulation: Challenges, representations, and algorithms

O Kroemer, S Niekum, G Konidaris - Journal of machine learning research, 2021 - jmlr.org
A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …

Ase: Large-scale reusable adversarial skill embeddings for physically simulated characters

XB Peng, Y Guo, L Halper, S Levine… - ACM Transactions On …, 2022 - dl.acm.org
The incredible feats of athleticism demonstrated by humans are made possible in part by a
vast repertoire of general-purpose motor skills, acquired through years of practice and …

Interactive language: Talking to robots in real time

C Lynch, A Wahid, J Tompson, T Ding… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
We present a framework for building interactive, real-time, natural language-instructable
robots in the real world, and we open source related assets (dataset, environment …

Behavior Transformers: Cloning modes with one stone

NM Shafiullah, Z Cui… - Advances in neural …, 2022 - proceedings.neurips.cc
While behavior learning has made impressive progress in recent times, it lags behind
computer vision and natural language processing due to its inability to leverage large …

Foundation models for decision making: Problems, methods, and opportunities

S Yang, O Nachum, Y Du, J Wei, P Abbeel… - arXiv preprint arXiv …, 2023 - arxiv.org
Foundation models pretrained on diverse data at scale have demonstrated extraordinary
capabilities in a wide range of vision and language tasks. When such models are deployed …

Amp: Adversarial motion priors for stylized physics-based character control

XB Peng, Z Ma, P Abbeel, S Levine… - ACM Transactions on …, 2021 - dl.acm.org
Synthesizing graceful and life-like behaviors for physically simulated characters has been a
fundamental challenge in computer animation. Data-driven methods that leverage motion …

Contrastive learning as goal-conditioned reinforcement learning

B Eysenbach, T Zhang, S Levine… - Advances in Neural …, 2022 - proceedings.neurips.cc
In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …

Mobile aloha: Learning bimanual mobile manipulation with low-cost whole-body teleoperation

Z Fu, TZ Zhao, C Finn - arXiv preprint arXiv:2401.02117, 2024 - arxiv.org
Imitation learning from human demonstrations has shown impressive performance in
robotics. However, most results focus on table-top manipulation, lacking the mobility and …

Solving rubik's cube with a robot hand

I Akkaya, M Andrychowicz, M Chociej, M Litwin… - arXiv preprint arXiv …, 2019 - arxiv.org
We demonstrate that models trained only in simulation can be used to solve a manipulation
problem of unprecedented complexity on a real robot. This is made possible by two key …