3D object detection for autonomous driving: A comprehensive survey

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

[HTML][HTML] 2D and 3D object detection algorithms from images: A Survey

W Chen, Y Li, Z Tian, F Zhang - Array, 2023 - Elsevier
Object detection is a crucial branch of computer vision that aims to locate and classify
objects in images. Using deep convolutional neural networks (CNNs) as the primary …

Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision

C Yang, Y Chen, H Tian, C Tao, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel bird's-eye-view (BEV) detector with perspective supervision, which
converges faster and better suits modern image backbones. Existing state-of-the-art BEV …

Time will tell: New outlooks and a baseline for temporal multi-view 3d object detection

J Park, C Xu, S Yang, K Keutzer, KM Kitani… - The Eleventh …, 2022 - openreview.net
While recent camera-only 3D detection methods leverage multiple timesteps, the limited
history they use significantly hampers the extent to which temporal fusion can improve object …

Fb-bev: Bev representation from forward-backward view transformations

Z Li, Z Yu, W Wang, A Anandkumar… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract View Transformation Module (VTM), where transformations happen between multi-
view image features and Bird-Eye-View (BEV) representation, is a crucial step in camera …

Vision-centric bev perception: A survey

Y Ma, T Wang, X Bai, H Yang, Y Hou, Y Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant
interest from both industry and academia due to its inherent advantages, such as providing …

Bevheight: A robust framework for vision-based roadside 3d object detection

L Yang, K Yu, T Tang, J Li, K Yuan… - Proceedings of the …, 2023 - openaccess.thecvf.com
While most recent autonomous driving system focuses on developing perception methods
on ego-vehicle sensors, people tend to overlook an alternative approach to leverage …

Temporal enhanced training of multi-view 3d object detector via historical object prediction

Z Zong, D Jiang, G Song, Z Xue… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose a new paradigm, named Historical Object Prediction (HoP) for
multi-view 3D detection to leverage temporal information more effectively. The HoP …

Exploring recurrent long-term temporal fusion for multi-view 3d perception

C Han, J Yang, J Sun, Z Ge, R Dong… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
Long-term temporal fusion is a crucial but often overlooked technique in camera-based
Bird's-Eye-View (BEV) 3D perception. Existing methods are mostly in a parallel manner …

Matrixvt: Efficient multi-camera to bev transformation for 3d perception

H Zhou, Z Ge, Z Li, X Zhang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
This paper proposes an efficient multi-camera to Bird's-Eye-View (BEV) view transformation
method for 3D perception, dubbed MatrixVT. Existing view transformers either suffer from …