3D object detection for autonomous driving: A comprehensive survey

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

Voxelnext: Fully sparse voxelnet for 3d object detection and tracking

Y Chen, J Liu, X Zhang, X Qi… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract 3D object detectors usually rely on hand-crafted proxies, eg, anchors or centers,
and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be …

Fully convolutional one-stage 3d object detection on lidar range images

Z Tian, X Chu, X Wang, X Wei… - Advances in Neural …, 2022 - proceedings.neurips.cc
We present a simple yet effective fully convolutional one-stage 3D object detector for LiDAR
point clouds of autonomous driving scenes, termed FCOS-LiDAR. Unlike the dominant …

PillarNeXt: Rethinking network designs for 3D object detection in LiDAR point clouds

J Li, C Luo, X Yang - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
In order to deal with the sparse and unstructured raw point clouds, most LiDAR based 3D
object detection research focuses on designing dedicated local point aggregators for fine …

GD-MAE: generative decoder for MAE pre-training on lidar point clouds

H Yang, T He, J Liu, H Chen, B Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite the tremendous progress of Masked Autoencoders (MAE) in developing vision tasks
such as image and video, exploring MAE in large-scale 3D point clouds remains …

Geomae: Masked geometric target prediction for self-supervised point cloud pre-training

X Tian, H Ran, Y Wang, H Zhao - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
This paper tries to address a fundamental question in point cloud self-supervised learning:
what is a good signal we should leverage to learn features from point clouds without …

Ad-pt: Autonomous driving pre-training with large-scale point cloud dataset

J Yuan, B Zhang, X Yan, B Shi… - Advances in Neural …, 2024 - proceedings.neurips.cc
It is a long-term vision for Autonomous Driving (AD) community that the perception models
can learn from a large-scale point cloud dataset, to obtain unified representations that can …

Link: Linear kernel for lidar-based 3d perception

T Lu, X Ding, H Liu, G Wu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Extending the success of 2D Large Kernel to 3D perception is challenging due to: 1. the
cubically-increasing overhead in processing 3D data; 2. the optimization difficulties from …

Pvt-ssd: Single-stage 3d object detector with point-voxel transformer

H Yang, W Wang, M Chen, B Lin, T He… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent Transformer-based 3D object detectors learn point cloud features either from point-
or voxel-based representations. However, the former requires time-consuming sampling …

Largekernel3d: Scaling up kernels in 3d sparse cnns

Y Chen, J Liu, X Zhang, X Qi… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Recent advance in 2D CNNs has revealed that large kernels are important. However, when
directly applying large convolutional kernels in 3D CNNs, severe difficulties are met, where …