MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

X Cao, T Zhou, Y Ma, W Ye, C Cui… - Proceedings of the …, 2024 - openaccess.thecvf.com
Vision-language generative AI has demonstrated remarkable promise for empowering cross-
modal scene understanding of autonomous driving and high-definition (HD) map systems …

VistaScenario: Interaction Scenario Engineering for Vehicles with Intelligent Systems for Transport Automation

C Chang, J Zhang, J Ge, Z Zhang, J Wei… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Intelligent vehicles and autonomous driving systems rely on scenario engineering for
intelligence and index (I&I), calibration and certification (C&C), and verification and …

Graphic Design with Large Multimodal Model

Y Cheng, Z Zhang, M Yang, H Nie, C Li, X Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
In the field of graphic design, automating the integration of design elements into a cohesive
multi-layered artwork not only boosts productivity but also paves the way for the …

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

S Wang, Z Yu, X Jiang, S Lan, M Shi, N Chang… - arXiv preprint arXiv …, 2024 - arxiv.org
The advances in multimodal large language models (MLLMs) have led to growing interests
in LLM-based autonomous driving agents to leverage their strong reasoning capabilities …

Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving

R Tian, B Li, X Weng, Y Chen, E Schmerling… - arXiv preprint arXiv …, 2024 - arxiv.org
The autonomous driving industry is increasingly adopting end-to-end learning from sensory
inputs to minimize human biases in system design. Traditional end-to-end driving models …

[PDF][PDF] End-to-End Autonomous Driving Using Vision Language Model

T ZERON - opendrivelab.github.io
End-to-end autonomous driving has drawn tremendous attention recently. Many works focus
on using modular deep neural networks to construct the end-to-end architecture. However …