On the prospects of incorporating large language models (llms) in automated planning and scheduling (aps)

V Pallagani, BC Muppasani, K Roy, F Fabiano… - Proceedings of the …, 2024 - ojs.aaai.org
Abstract Automated Planning and Scheduling is among the growing areas in Artificial
Intelligence (AI) where mention of LLMs has gained popularity. Based on a comprehensive …

A survey on knowledge distillation of large language models

X Xu, M Li, C Tao, T Shen, R Cheng, J Li, C Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
This survey presents an in-depth exploration of knowledge distillation (KD) techniques
within the realm of Large Language Models (LLMs), spotlighting the pivotal role of KD in …

Gpt-4v (ision) for robotics: Multimodal task planning from human demonstration

N Wake, A Kanehira, K Sasabuchi, J Takamatsu… - arXiv preprint arXiv …, 2023 - arxiv.org
We introduce a pipeline that enhances a general-purpose Vision Language Model, GPT-4V
(ision), by integrating observations of human actions to facilitate robotic manipulation. This …

Multilingual large language model: A survey of resources, taxonomy and frontiers

L Qin, Q Chen, Y Zhou, Z Chen, Y Li, L Liao… - arXiv preprint arXiv …, 2024 - arxiv.org
Multilingual Large Language Models are capable of using powerful Large Language
Models to handle and respond to queries in multiple languages, which achieves remarkable …

Ldb: A large language model debugger via verifying runtime execution step-by-step

L Zhong, Z Wang, J Shang - arXiv preprint arXiv:2402.16906, 2024 - arxiv.org
Large language models (LLMs) are leading significant progress in code generation. Beyond
one-pass code generation, recent works further integrate unit tests and program verifiers into …

Large language models meet nlp: A survey

L Qin, Q Chen, X Feng, Y Wu, Y Zhang, Y Li… - arXiv preprint arXiv …, 2024 - arxiv.org
While large language models (LLMs) like ChatGPT have shown impressive capabilities in
Natural Language Processing (NLP) tasks, a systematic investigation of their potential in this …

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis

Y Mu, J Chen, Q Zhang, S Chen, Q Yu, C Ge… - arXiv preprint arXiv …, 2024 - arxiv.org
Robotic behavior synthesis, the problem of understanding multimodal inputs and generating
precise physical control for robots, is an important part of Embodied AI. Despite successes in …

MCoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought

Q Chen, L Qin, J Zhang, Z Chen, X Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
Multi-modal Chain-of-Thought (MCoT) requires models to leverage knowledge from both
textual and visual modalities for step-by-step reasoning, which gains increasing attention …

Learning Reward for Robot Skills Using Large Language Models via Self-Alignment

Y Zeng, Y Mu, L Shao - arXiv preprint arXiv:2405.07162, 2024 - arxiv.org
Learning reward functions remains the bottleneck to equip a robot with a broad repertoire of
skills. Large Language Models (LLM) contain valuable task-related knowledge that can …

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration

Y Zhang, S Yang, C Bai, F Wu, X Li, X Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Grounding the reasoning ability of large language models (LLMs) for embodied tasks is
challenging due to the complexity of the physical world. Especially, LLM planning for multi …