过去一年中添加的文章,按日期排序

M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving

D Xu, H Li, Q Wang, Z Song, L Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
125 天前 - … current end-to-end autonomous driving can continue to improve: 1) more effective
multi-modal environment perception that can better integrate data from multi-modal and multi-…

Enhancing Autonomous Driving: A Low-Cost Monocular End-to-End Framework with Multi-Task Integration and Temporal Fusion

Z Rao, Y Cai, H Wang, Y Lian, Y Zhong… - … Intelligent Vehicles, 2024 - ieeexplore.ieee.org
159 天前 - … —End to end autonomous driving system has rapidly progressed and garnered
significant attention. Recently, multimodal fusion … a fusion transformer of multi-modal sensor to …

End-to-End Video Captioning Based on Multiview Semantic Alignment for Human–Machine Fusion

S Wu, Y Gao, W Yang, H Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
183 天前 - … • We propose MVVC, an end-to-end transformer-based video captioning model,
which … and autonomous driving. Experimental results show that our method achieves SOTA. …

FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Object Detection

C Hu, H Zheng, K Li, J Xu, W Mao, M Luo, L Wang… - 2023 - openreview.net
327 天前 - … a novel end-to-end multi-modal fusion transformer-based … and residual structures
within the fusion encoding module. … on a popular autonomous driving benchmark dataset, …