Benchmarks for automated commonsense reasoning: A survey

E Davis - ACM Computing Surveys, 2023 - dl.acm.org
More than one hundred benchmarks have been developed to test the commonsense
knowledge and commonsense reasoning abilities of artificial intelligence (AI) systems …

On the prospects of incorporating large language models (llms) in automated planning and scheduling (aps)

V Pallagani, BC Muppasani, K Roy, F Fabiano… - Proceedings of the …, 2024 - ojs.aaai.org
Abstract Automated Planning and Scheduling is among the growing areas in Artificial
Intelligence (AI) where mention of LLMs has gained popularity. Based on a comprehensive …

Tidybot: Personalized robot assistance with large language models

J Wu, R Antonova, A Kan, M Lepert, A Zeng, S Song… - Autonomous …, 2023 - Springer
For a robot to personalize physical assistance effectively, it must learn user preferences that
can be generally reapplied to future scenarios. In this work, we investigate personalization of …

Llm+ p: Empowering large language models with optimal planning proficiency

B Liu, Y Jiang, X Zhang, Q Liu, S Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated remarkable zero-shot generalization
abilities: state-of-the-art chatbots can provide plausible answers to many common questions …

Guiding pretraining in reinforcement learning with large language models

Y Du, O Watkins, Z Wang, C Colas… - International …, 2023 - proceedings.mlr.press
Reinforcement learning algorithms typically struggle in the absence of a dense, well-shaped
reward function. Intrinsically motivated exploration methods address this limitation by …

Large language models as commonsense knowledge for large-scale task planning

Z Zhao, WS Lee, D Hsu - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Large-scale task planning is a major challenge. Recent work exploits large language
models (LLMs) directly as a policy and shows surprisingly interesting results. This paper …

Task and motion planning with large language models for object rearrangement

Y Ding, X Zhang, C Paxton… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org
Multi-object rearrangement is a crucial skill for service robots, and commonsense reasoning
is frequently needed in this process. However, achieving commonsense arrangements …

Language models meet world models: Embodied experiences enhance language models

J Xiang, T Tao, Y Gu, T Shu, Z Wang… - Advances in neural …, 2024 - proceedings.neurips.cc
While large language models (LMs) have shown remarkable capabilities across numerous
tasks, they often struggle with simple reasoning and planning in physical environments …

Dall-e-bot: Introducing web-scale diffusion models to robotics

I Kapelyukh, V Vosylius, E Johns - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
We introduce the first work to explore web-scale diffusion models for robotics. DALL-E-Bot
enables a robot to rearrange objects in a scene, by first inferring a text description of those …

Retrospectives on the embodied ai workshop

M Deitke, D Batra, Y Bisk, T Campari, AX Chang… - arXiv preprint arXiv …, 2022 - arxiv.org
We present a retrospective on the state of Embodied AI research. Our analysis focuses on
13 challenges presented at the Embodied AI Workshop at CVPR. These challenges are …