If llm is the wizard, then code is the wand: A survey on how code empowers large language models to serve as intelligent agents

K Yang, J Liu, J Wu, C Yang, YR Fung, S Li… - arXiv preprint arXiv …, 2024 - arxiv.org
The prominent large language models (LLMs) of today differ from past language models not
only in size, but also in the fact that they are trained on a combination of natural language …

Towards knowledge-driven autonomous driving

X Li, Y Bai, P Cai, L Wen, D Fu, B Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper explores the emerging knowledge-driven autonomous driving technologies. Our
investigation highlights the limitations of current autonomous driving systems, in particular …

Rag-driver: Generalisable driving explanations with retrieval-augmented in-context learning in multi-modal large language model

J Yuan, S Sun, D Omeiza, B Zhao, P Newman… - arXiv preprint arXiv …, 2024 - arxiv.org
Robots powered by'blackbox'models need to provide human-understandable explanations
which we can trust. Hence, explainability plays a critical role in trustworthy autonomous …

A survey of neural code intelligence: Paradigms, advances and beyond

Q Sun, Z Chen, F Xu, K Cheng, C Ma, Z Yin… - arXiv preprint arXiv …, 2024 - arxiv.org
Neural Code Intelligence--leveraging deep learning to understand, generate, and optimize
code--holds immense potential for transformative impacts on the whole society. Bridging the …

MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

X Cao, T Zhou, Y Ma, W Ye, C Cui… - Proceedings of the …, 2024 - openaccess.thecvf.com
Vision-language generative AI has demonstrated remarkable promise for empowering cross-
modal scene understanding of autonomous driving and high-definition (HD) map systems …

Why solving multi-agent path finding with large language model has not succeeded yet

W Chen, S Koenig, B Dilkina - arXiv preprint arXiv:2401.03630, 2024 - arxiv.org
With the explosive influence caused by the success of large language models (LLM) like
ChatGPT and GPT-4, there has been an extensive amount of recent work showing that …

Behavior generation with latent actions

S Lee, Y Wang, H Etukuru, HJ Kim… - arXiv preprint arXiv …, 2024 - arxiv.org
Generative modeling of complex behaviors from labeled datasets has been a longstanding
problem in decision making. Unlike language or image generation, decision making …

Graphad: Interaction scene graph for end-to-end autonomous driving

Y Zhang, D Qian, D Li, Y Pan, Y Chen, Z Liang… - arXiv preprint arXiv …, 2024 - arxiv.org
Modeling complicated interactions among the ego-vehicle, road agents, and map elements
has been a crucial part for safety-critical autonomous driving. Previous works on end-to-end …

Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

S Luo, W Chen, W Tian, R Liu, L Hou… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Foundation models have indeed made a profound impact on various fields, emerging as
pivotal components that significantly shape the capabilities of intelligent systems. In the …

DrPlanner: Diagnosis and Repair of Motion Planners Using Large Language Models

Y Lin, C Li, M Ding, M Tomizuka, W Zhan… - arXiv preprint arXiv …, 2024 - arxiv.org
Motion planners are essential for the safe operation of automated vehicles across various
scenarios. However, no motion planning algorithm has achieved perfection in the literature …