Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Grid-centric traffic scenario perception for autonomous driving: A comprehensive review

Y Shi, K Jiang, J Li, Z Qian, J Wen… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
The grid-centric perception is a crucial field for mobile robot perception and navigation.
Nonetheless, the grid-centric perception is less prevalent than object-centric perception as …

Exploring object-centric temporal modeling for efficient multi-view 3d object detection

S Wang, Y Liu, T Wang, Y Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper, we propose a long-sequence modeling framework, named StreamPETR, for
multi-view 3D object detection. Built upon the sparse query design in the PETR series, we …

Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision

C Yang, Y Chen, H Tian, C Tao, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel bird's-eye-view (BEV) detector with perspective supervision, which
converges faster and better suits modern image backbones. Existing state-of-the-art BEV …

Petrv2: A unified framework for 3d perception from multi-camera images

Y Liu, J Yan, F Jia, S Li, A Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose PETRv2, a unified framework for 3D perception from multi-view
images. Based on PETR, PETRv2 explores the effectiveness of temporal modeling, which …

Bevstereo: Enhancing depth estimation in multi-view 3d object detection with temporal stereo

Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Restricted by the ability of depth perception, all Multi-view 3D object detection methods fall
into the bottleneck of depth accuracy. By constructing temporal stereo, depth estimation is …

Vision-centric bev perception: A survey

Y Ma, T Wang, X Bai, H Yang, Y Hou… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant
interest from both industry and academia due to its inherent advantages, such as providing …

Visual point cloud forecasting enables scalable autonomous driving

Z Yang, L Chen, Y Sun, H Li - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
In contrast to extensive studies on general vision pre-training for scalable visual
autonomous driving remains seldom explored. Visual autonomous driving applications …

Bevdistill: Cross-modal bev distillation for multi-view 3d object detection

Z Chen, Z Li, S Zhang, L Fang, Q Jiang… - arXiv preprint arXiv …, 2022 - arxiv.org
3D object detection from multiple image views is a fundamental and challenging task for
visual scene understanding. Owing to its low cost and high efficiency, multi-view 3D object …

QE-BEV: Query evolution for bird's eye view object detection in varied contexts

J Yao, Y Lai, H Kou, T Wu, R Liu - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
3D object detection plays a pivotal role in autonomous driving and robotics, demanding
precise interpretation of Bird's Eye View (BEV) images. The dynamic nature of real-world …