Drivegpt4: Interpretable end-to-end autonomous driving via large language model

Explainable artificial intelligence for autonomous driving: A comprehensive overview and field guide for future research directions

S Atakishiyev, M Salameh, H Yao, R Goebel - IEEE Access, 2024 - ieeexplore.ieee.org

Autonomous driving has achieved significant milestones in research and development over
the last two decades. There is increasing interest in the field as the deployment of …

被引用次数：135 相关文章所有 4 个版本

[PDF] thecvf.com

A survey on multimodal large language models for autonomous driving

C Cui, Y Ma, X Cao, W Ye, Y Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com

With the emergence of Large Language Models (LLMs) and Vision Foundation Models
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …

被引用次数：100 相关文章所有 7 个版本

[PDF] thecvf.com

Lmdrive: Closed-loop end-to-end driving with large language models

H Shao, Y Hu, L Wang, G Song… - Proceedings of the …, 2024 - openaccess.thecvf.com

Despite significant recent progress in the field of autonomous driving modern methods still
struggle and can incur serious accidents when encountering long-tail unforeseen events …

被引用次数：34 相关文章所有 4 个版本

[PDF] arxiv.org

Drivelm: Driving with graph visual question answering

C Sima, K Renz, K Chitta, L Chen, H Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

We study how vision-language models (VLMs) trained on web-scale data can be integrated
into end-to-end driving systems to boost generalization and enable interactivity with human …

被引用次数：52 相关文章所有 5 个版本

[PDF] thecvf.com

Gpt4point: A unified framework for point-language understanding and generation

Z Qi, Y Fang, Z Sun, X Wu, T Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Multimodal Large Language Models (MLLMs) have excelled in 2D image-text
comprehension and image generation but their understanding of the 3D world is notably …

被引用次数：11 相关文章所有 3 个版本

[PDF] thecvf.com

Lampilot: An open benchmark dataset for autonomous driving with language model programs

Y Ma, C Cui, X Cao, W Ye, P Liu, J Lu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Autonomous driving (AD) has made significant strides in recent years. However existing
frameworks struggle to interpret and execute spontaneous user instructions such as" …

被引用次数：13 相关文章所有 4 个版本

[PDF] arxiv.org

The (r) evolution of multimodal large language models: A survey

D Caffagni, F Cocchi, L Barsellotti, N Moratelli… - arXiv preprint arXiv …, 2024 - arxiv.org

Connecting text and visual modalities plays an essential role in generative intelligence. For
this reason, inspired by the success of large language models, significant research efforts …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

A language agent for autonomous driving

J Mao, J Ye, Y Qian, M Pavone, Y Wang - arXiv preprint arXiv:2311.10813, 2023 - arxiv.org

Human-level driving is an ultimate goal of autonomous driving. Conventional approaches
formulate autonomous driving as a perception-prediction-planning framework, yet their …

被引用次数：28 相关文章所有 2 个版本

[PDF] thecvf.com

Omnimedvqa: A new large-scale comprehensive evaluation benchmark for medical lvlm

Y Hu, T Li, Q Lu, W Shao, J He… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Large Vision-Language Models (LVLMs) have demonstrated remarkable
capabilities in various multimodal tasks. However their potential in the medical domain …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

Vision language models in autonomous driving and intelligent transportation systems

X Zhou, M Liu, BL Zagar, E Yurtsever… - arXiv preprint arXiv …, 2023 - arxiv.org

The applications of Vision-Language Models (VLMs) in the fields of Autonomous Driving
(AD) and Intelligent Transportation Systems (ITS) have attracted widespread attention due to …

被引用次数：27 相关文章所有 2 个版本

高级搜索

QQ 群