Multi-modal 3d object detection in autonomous driving: A survey and taxonomy

L Wang, X Zhang, Z Song, J Bi, G Zhang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Autonomous vehicles require constant environmental perception to obtain the distribution of
obstacles to achieve safe driving. Specifically, 3D object detection is a vital functional …

Robustness-aware 3d object detection in autonomous driving: A review and outlook

Z Song, L Liu, F Jia, Y Luo, C Jia… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
In the realm of modern autonomous driving, the perception system is indispensable for
accurately assessing the state of the surrounding environment, thereby enabling informed …

Voxelnext: Fully sparse voxelnet for 3d object detection and tracking

Y Chen, J Liu, X Zhang, X Qi… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract 3D object detectors usually rely on hand-crafted proxies, eg, anchors or centers,
and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be …

Virtual sparse convolution for multimodal 3d object detection

H Wu, C Wen, S Shi, X Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Recently, virtual/pseudo-point-based 3D object detection that seamlessly fuses
RGB images and LiDAR data by depth completion has gained great attention. However …

Logonet: Towards accurate 3d object detection with local-to-global cross-modal fusion

X Li, T Ma, Y Hou, B Shi, Y Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
LiDAR-camera fusion methods have shown impressive performance in 3D object detection.
Recent advanced multi-modal methods mainly perform global fusion, where image features …

Unipad: A universal pre-training paradigm for autonomous driving

H Yang, S Zhang, D Huang, X Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
In the context of autonomous driving the significance of effective feature learning is widely
acknowledged. While conventional 3D self-supervised pre-training methods have shown …

GD-MAE: generative decoder for MAE pre-training on lidar point clouds

H Yang, T He, J Liu, H Chen, B Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite the tremendous progress of Masked Autoencoders (MAE) in developing vision tasks
such as image and video, exploring MAE in large-scale 3D point clouds remains …

Octr: Octree-based transformer for 3d object detection

C Zhou, Y Zhang, J Chen… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
A key challenge for LiDAR-based 3D object detection is to capture sufficient features from
large scale 3D scenes especially for distant or/and occluded objects. Albeit recent efforts …

Graphalign: Enhancing accurate feature alignment by graph matching for multi-modal 3d object detection

Z Song, H Wei, L Bai, L Yang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
LiDAR and cameras are complementary sensors for 3D object detection in autonomous
driving. However, it is challenging to explore the unnatural interaction between point clouds …

Pvt-ssd: Single-stage 3d object detector with point-voxel transformer

H Yang, W Wang, M Chen, B Lin, T He… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent Transformer-based 3D object detectors learn point cloud features either from point-
or voxel-based representations. However, the former requires time-consuming sampling …