Long-horizon visual planning with goal-conditioned hierarchical predictors

T Silver, S Dan, K Srinivas, JB Tenenbaum… - Proceedings of the …, 2024 - ojs.aaai.org

Recent work has considered whether large language models (LLMs) can function as
planners: given a task, generate a plan. We investigate whether LLMs can serve as …

被引用次数：56 相关文章所有 6 个版本

[PDF] mlr.press

Goal-conditioned reinforcement learning with imagined subgoals

E Chane-Sane, C Schmid… - … conference on machine …, 2021 - proceedings.mlr.press

Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it
often struggles to solve tasks that require more temporally extended reasoning. In this work …

被引用次数：124 相关文章所有 7 个版本

[PDF] mlr.press

Latent plans for task-agnostic offline reinforcement learning

E Rosete-Beas, O Mees, G Kalweit… - … on Robot Learning, 2023 - proceedings.mlr.press

Everyday tasks of long-horizon and comprising a sequence of multiple implicit subtasks still
impose a major challenge in offline robot control. While a number of prior methods aimed to …

被引用次数：48 相关文章所有 6 个版本

[PDF] arxiv.org

Mindstorms in natural language-based societies of mind

M Zhuge, H Liu, F Faccio, DR Ashley… - arXiv preprint arXiv …, 2023 - arxiv.org

Both Minsky's" society of mind" and Schmidhuber's" learning to think" inspire diverse
societies of large multimodal neural networks (NNs) that solve problems by interviewing …

被引用次数：40 相关文章所有 7 个版本

[PDF] arxiv.org

Hierarchical planning for long-horizon manipulation with geometric and symbolic scene graphs

Y Zhu, J Tremblay, S Birchfield… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org

We present a visually grounded hierarchical planning algorithm for long-horizon
manipulation tasks. Our algorithm offers a joint framework of neuro-symbolic task planning …

被引用次数：100 相关文章所有 4 个版本

[PDF] arxiv.org

Bits: Bi-level imitation for traffic simulation

D Xu, Y Chen, B Ivanovic… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Simulation is the key to scaling up validation and verification for robotic systems such as
autonomous vehicles. Despite advances in high-fidelity physics and sensor simulation, a …

被引用次数：45 相关文章所有 3 个版本

[PDF] mlr.press

World model as a graph: Learning latent landmarks for planning

L Zhang, G Yang, BC Stadie - International conference on …, 2021 - proceedings.mlr.press

Planning, the ability to analyze the structure of a problem in the large and decompose it into
interrelated subproblems, is a hallmark of human intelligence. While deep reinforcement …

被引用次数：74 相关文章所有 7 个版本

[PDF] neurips.cc

Clockwork variational autoencoders

V Saxena, J Ba, D Hafner - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Deep learning has enabled algorithms to generate realistic images. However, accurately
predicting long video sequences requires understanding long-term dependencies and …

被引用次数：61 相关文章所有 7 个版本

[PDF] mlr.press

Learning task informed abstractions

X Fu, G Yang, P Agrawal… - … Conference on Machine …, 2021 - proceedings.mlr.press

Current model-based reinforcement learning methods struggle when operating from
complex visual scenes due to their inability to prioritize task-relevant features. To mitigate …

被引用次数：58 相关文章所有 9 个版本

[PDF] mlr.press

Generalization with lossy affordances: Leveraging broad offline data for learning visuomotor tasks

K Fang, P Yin, A Nair, HR Walke… - … on Robot Learning, 2023 - proceedings.mlr.press

The use of broad datasets has proven to be crucial for generalization for a wide range of
fields. However, how to effectively make use of diverse multi-task data for novel downstream …

被引用次数：17 相关文章所有 4 个版本

高级搜索

QQ 群