Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving

M Azarafza, M Nayyeri, C Steinmetz, S Staab… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) have garnered significant attention for their ability to
understand text and images, generate human-like text, and perform complex reasoning …

Towards robust multi-modal reasoning via model selection

X Liu, R Li, W Ji, T Lin - arXiv preprint arXiv:2310.08446, 2023 - arxiv.org
The reasoning capabilities of LLM (Large Language Model) are widely acknowledged in
recent research, inspiring studies on tool learning and autonomous agents. LLM serves as …

Large language models for autonomous driving: Real-world experiments

C Cui, Z Yang, Y Zhou, Y Ma, J Lu, Z Wang - arXiv preprint arXiv …, 2023 - arxiv.org
Autonomous driving systems are increasingly popular in today's technological landscape,
where vehicles with partial automation have already been widely available on the market …

Drivellm: Charting the path toward full autonomous driving with large language models

Y Cui, S Huang, J Zhong, Z Liu, Y Wang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Human drivers instinctively reason with commonsense knowledge to predict hazards in
unfamiliar scenarios and to understand the intentions of other road users. However, this …

Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective

M Chen, Y Cao, Y Zhang, C Lu - arXiv preprint arXiv:2403.18346, 2024 - arxiv.org
Recent advancements in Large Language Models (LLMs) have facilitated the development
of Multimodal LLMs (MLLMs). Despite their impressive capabilities, MLLMs often suffer from …

ChatGPT is on the horizon: could a large language model be suitable for intelligent traffic safety research and applications?

O Zheng, M Abdel-Aty, D Wang, Z Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
ChatGPT embarks on a new era of artificial intelligence and will revolutionize the way we
approach intelligent traffic safety systems. This paper begins with a brief introduction about …

NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models

L Fan, W Hua, X Li, K Zhu, M Jin, L Li, H Ling… - arXiv preprint arXiv …, 2024 - arxiv.org
Understanding the reasoning capabilities of Multimodal Large Language Models (MLLMs) is
an important area of research. In this study, we introduce a dynamic benchmark …

Chef: A comprehensive evaluation framework for standardized assessment of multimodal large language models

Z Shi, Z Wang, H Fan, Z Yin, L Sheng, Y Qiao… - arXiv preprint arXiv …, 2023 - arxiv.org
Multimodal Large Language Models (MLLMs) have shown impressive abilities in interacting
with visual content with myriad potential downstream tasks. However, even though a list of …

LLM-based Operating Systems for Automated Vehicles: A New Perspective

J Ge, C Chang, J Zhang, L Li, X Na… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
The deployment of large language models (LLMs) brings challenges to intelligent systems
because its capability of integrating large-scale training data facilitates contextual reasoning …

Lampilot: An open benchmark dataset for autonomous driving with language model programs

Y Ma, C Cui, X Cao, W Ye, P Liu, J Lu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Autonomous driving (AD) has made significant strides in recent years. However existing
frameworks struggle to interpret and execute spontaneous user instructions such as" …