Vadv2: End-to-end vectorized autonomous driving via probabilistic planning

S Chen, B Jiang, H Gao, B Liao, Q Xu, Q Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Learning a human-like driving policy from large-scale driving demonstrations is promising,
but the uncertainty and non-deterministic nature of planning make it challenging. In this …

Nuscenes-mqa: Integrated evaluation of captions and qa for autonomous driving datasets using markup annotations

Y Inoue, Y Yada, K Tanahashi… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Visual Question Answering (VQA) is one of the most important tasks in autonomous
driving, which requires accurate recognition and complex situation evaluations. However …

Integrating big data analytics in autonomous driving: An unsupervised hierarchical reinforcement learning approach

Z Mao, Y Liu, X Qu - Transportation Research Part C: Emerging …, 2024 - Elsevier
In the realm of autonomous vehicular systems, there has been a notable increase in end-to-
end algorithms designed for complete self-navigation. Researchers are increasingly …

LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving

D Fu, W Lei, L Wen, P Cai, S Mao, M Dou, B Shi… - arXiv preprint arXiv …, 2024 - arxiv.org
The emergence of Multimodal Large Language Models ((M) LLMs) has ushered in new
avenues in artificial intelligence, particularly for autonomous driving by offering enhanced …

MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

X Cao, T Zhou, Y Ma, W Ye, C Cui… - Proceedings of the …, 2024 - openaccess.thecvf.com
Vision-language generative AI has demonstrated remarkable promise for empowering cross-
modal scene understanding of autonomous driving and high-definition (HD) map systems …

Drivecot: Integrating chain-of-thought reasoning with end-to-end driving

T Wang, E Xie, R Chu, Z Li, P Luo - arXiv preprint arXiv:2403.16996, 2024 - arxiv.org
End-to-end driving has made significant progress in recent years, demonstrating benefits
such as system simplicity and competitive driving performance under both open-loop and …

MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

Y Tang, X Han, X Li, Q Yu, Y Hao, L Hu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large 2D vision-language models (2D-LLMs) have gained significant attention by bridging
Large Language Models (LLMs) with images using a simple projector. Inspired by their …

Feedback-Guided Autonomous Driving

J Zhang, Z Huang, A Ray… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
While behavior cloning has recently emerged as a highly successful paradigm for
autonomous driving humans rarely learn to perform complex tasks such as driving via …

ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles

J Zhang, C Xu, B Li - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Abstract We present ChatScene a Large Language Model (LLM)-based agent that
leverages the capabilities of LLMs to generate safety-critical scenarios for autonomous …

Large Language Model-based Human-Agent Collaboration for Complex Task Solving

X Feng, ZY Chen, Y Qin, Y Lin, X Chen, Z Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent developments within the research community, the integration of Large Language
Models (LLMs) in creating fully autonomous agents has garnered significant interest …