[HTML][HTML] 2D and 3D object detection algorithms from images: A Survey

W Chen, Y Li, Z Tian, F Zhang - Array, 2023 - Elsevier
Object detection is a crucial branch of computer vision that aims to locate and classify
objects in images. Using deep convolutional neural networks (CNNs) as the primary …

Robustness-aware 3d object detection in autonomous driving: A review and outlook

Z Song, L Liu, F Jia, Y Luo, C Jia… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
In the realm of modern autonomous driving, the perception system is indispensable for
accurately assessing the state of the surrounding environment, thereby enabling informed …

Exploring object-centric temporal modeling for efficient multi-view 3d object detection

S Wang, Y Liu, T Wang, Y Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper, we propose a long-sequence modeling framework, named StreamPETR, for
multi-view 3D object detection. Built upon the sparse query design in the PETR series, we …

Sparsebev: High-performance sparse 3d object detection from multi-camera videos

H Liu, Y Teng, T Lu, H Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Camera-based 3D object detection in BEV (Bird's Eye View) space has drawn great
attention over the past few years. Dense detectors typically follow a two-stage pipeline by …

Fb-bev: Bev representation from forward-backward view transformations

Z Li, Z Yu, W Wang, A Anandkumar… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract View Transformation Module (VTM), where transformations happen between multi-
view image features and Bird-Eye-View (BEV) representation, is a crucial step in camera …

Unitr: A unified and efficient multi-modal transformer for bird's-eye-view representation

H Wang, H Tang, S Shi, A Li, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Jointly processing information from multiple sensors is crucial to achieving accurate and
robust perception for reliable autonomous driving systems. However, current 3D perception …

Unipad: A universal pre-training paradigm for autonomous driving

H Yang, S Zhang, D Huang, X Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
In the context of autonomous driving the significance of effective feature learning is widely
acknowledged. While conventional 3D self-supervised pre-training methods have shown …

Vision-centric bev perception: A survey

Y Ma, T Wang, X Bai, H Yang, Y Hou, Y Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant
interest from both industry and academia due to its inherent advantages, such as providing …

Efficient deformable convnets: Rethinking dynamic and sparse operator for vision applications

Y Xiong, Z Li, Y Chen, F Wang, X Zhu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract We introduce Deformable Convolution v4 (DCNv4) a highly efficient and effective
operator designed for a broad spectrum of vision applications. DCNv4 addresses the …

Is ego status all you need for open-loop end-to-end autonomous driving?

Z Li, Z Yu, S Lan, J Li, J Kautz, T Lu… - Proceedings of the …, 2024 - openaccess.thecvf.com
End-to-end autonomous driving recently emerged as a promising research direction to
target autonomy from a full-stack perspective. Along this line many of the latest works follow …