Drivevlm: The convergence of autonomous driving and large vision-language models

X Tian, J Gu, B Li, Y Liu, C Hu, Y Wang, K Zhan… - arXiv preprint arXiv …, 2024 - arxiv.org
A primary hurdle of autonomous driving in urban environments is understanding complex
and long-tail scenarios, such as challenging road conditions and delicate human behaviors …

Empowering autonomous driving with large language models: A safety perspective

Y Wang, R Jiao, C Lang, SS Zhan, C Huang… - arXiv preprint arXiv …, 2023 - arxiv.org
Autonomous Driving (AD) faces crucial hurdles for commercial launch, notably in the form of
diminished public trust and safety concerns from long-tail unforeseen driving scenarios. This …

Forging vision foundation models for autonomous driving: Challenges, methodologies, and opportunities

X Yan, H Zhang, Y Cai, J Guo, W Qiu, B Gao… - arXiv preprint arXiv …, 2024 - arxiv.org
The rise of large foundation models, trained on extensive datasets, is revolutionizing the
field of AI. Models such as SAM, DALL-E2, and GPT-4 showcase their adaptability by …

Rag-driver: Generalisable driving explanations with retrieval-augmented in-context learning in multi-modal large language model

J Yuan, S Sun, D Omeiza, B Zhao, P Newman… - arXiv preprint arXiv …, 2024 - arxiv.org
Robots powered by'blackbox'models need to provide human-understandable explanations
which we can trust. Hence, explainability plays a critical role in trustworthy autonomous …

LC-LLM: Explainable Lane-Change Intention and Trajectory Predictions with Large Language Models

M Peng, X Guo, X Chen, M Zhu, K Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
To ensure safe driving in dynamic environments, autonomous vehicles should possess the
capability to accurately predict the lane change intentions of surrounding vehicles in …

A survey on the memory mechanism of large language model based agents

Z Zhang, X Bo, C Ma, R Li, X Chen, Q Dai, J Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language model (LLM) based agents have recently attracted much attention from the
research and industry communities. Compared with original LLMs, LLM-based agents are …

MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

X Cao, T Zhou, Y Ma, W Ye, C Cui… - Proceedings of the …, 2024 - openaccess.thecvf.com
Vision-language generative AI has demonstrated remarkable promise for empowering cross-
modal scene understanding of autonomous driving and high-definition (HD) map systems …

Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

S Luo, W Chen, W Tian, R Liu, L Hou… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Foundation models have indeed made a profound impact on various fields, emerging as
pivotal components that significantly shape the capabilities of intelligent systems. In the …

Exploring the Potential of Large Language Models in Self-adaptive Systems

J Li, M Zhang, N Li, D Weyns, Z Jin, K Tei - Proceedings of the 19th …, 2024 - dl.acm.org
Large Language Models (LLMs), with their abilities in knowledge acquisition and reasoning,
can potentially enhance the various aspects of Self-adaptive Systems (SAS). Yet, the …

Asynchronous Large Language Model Enhanced Planner for Autonomous Driving

Y Chen, Z Ding, Z Wang, Y Wang, L Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Despite real-time planners exhibiting remarkable performance in autonomous driving, the
growing exploration of Large Language Models (LLMs) has opened avenues for enhancing …