Generalized planning in pddl domains with pretrained large language models

T Silver, S Dan, K Srinivas, JB Tenenbaum… - Proceedings of the …, 2024 - ojs.aaai.org
Recent work has considered whether large language models (LLMs) can function as
planners: given a task, generate a plan. We investigate whether LLMs can serve as …

Goal-conditioned reinforcement learning with imagined subgoals

E Chane-Sane, C Schmid… - … conference on machine …, 2021 - proceedings.mlr.press
Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it
often struggles to solve tasks that require more temporally extended reasoning. In this work …

Latent plans for task-agnostic offline reinforcement learning

E Rosete-Beas, O Mees, G Kalweit… - … on Robot Learning, 2023 - proceedings.mlr.press
Everyday tasks of long-horizon and comprising a sequence of multiple implicit subtasks still
impose a major challenge in offline robot control. While a number of prior methods aimed to …

Mindstorms in natural language-based societies of mind

M Zhuge, H Liu, F Faccio, DR Ashley… - arXiv preprint arXiv …, 2023 - arxiv.org
Both Minsky's" society of mind" and Schmidhuber's" learning to think" inspire diverse
societies of large multimodal neural networks (NNs) that solve problems by interviewing …

Hierarchical planning for long-horizon manipulation with geometric and symbolic scene graphs

Y Zhu, J Tremblay, S Birchfield… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
We present a visually grounded hierarchical planning algorithm for long-horizon
manipulation tasks. Our algorithm offers a joint framework of neuro-symbolic task planning …

Bits: Bi-level imitation for traffic simulation

D Xu, Y Chen, B Ivanovic… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Simulation is the key to scaling up validation and verification for robotic systems such as
autonomous vehicles. Despite advances in high-fidelity physics and sensor simulation, a …

World model as a graph: Learning latent landmarks for planning

L Zhang, G Yang, BC Stadie - International conference on …, 2021 - proceedings.mlr.press
Planning, the ability to analyze the structure of a problem in the large and decompose it into
interrelated subproblems, is a hallmark of human intelligence. While deep reinforcement …

Clockwork variational autoencoders

V Saxena, J Ba, D Hafner - Advances in Neural Information …, 2021 - proceedings.neurips.cc
Deep learning has enabled algorithms to generate realistic images. However, accurately
predicting long video sequences requires understanding long-term dependencies and …

Learning task informed abstractions

X Fu, G Yang, P Agrawal… - … Conference on Machine …, 2021 - proceedings.mlr.press
Current model-based reinforcement learning methods struggle when operating from
complex visual scenes due to their inability to prioritize task-relevant features. To mitigate …

Generalization with lossy affordances: Leveraging broad offline data for learning visuomotor tasks

K Fang, P Yin, A Nair, HR Walke… - … on Robot Learning, 2023 - proceedings.mlr.press
The use of broad datasets has proven to be crucial for generalization for a wide range of
fields. However, how to effectively make use of diverse multi-task data for novel downstream …