Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Grid-centric traffic scenario perception for autonomous driving: A comprehensive review

Y Shi, K Jiang, J Li, Z Qian, J Wen… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
The grid-centric perception is a crucial field for mobile robot perception and navigation.
Nonetheless, the grid-centric perception is less prevalent than object-centric perception as …

Drivegpt4: Interpretable end-to-end autonomous driving via large language model

Z Xu, Y Zhang, E Xie, Z Zhao, Y Guo… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
Multimodallarge language models (MLLMs) have emerged as a prominent area of interest
within the research community, given their proficiency in handling and reasoning with non …

Bevformer: Learning bird's-eye-view representation from multi-camera images via spatiotemporal transformers

Z Li, W Wang, H Li, E Xie, C Sima, T Lu, Y Qiao… - European conference on …, 2022 - Springer
Abstract 3D visual perception tasks, including 3D detection and map segmentation based on
multi-camera images, are essential for autonomous driving systems. In this work, we present …

Transfuser: Imitation with transformer-based sensor fusion for autonomous driving

K Chitta, A Prakash, B Jaeger, Z Yu… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
How should we integrate representations from complementary sensors for autonomous
driving? Geometry-based fusion has shown promise for perception (eg, object detection …

Polarformer: Multi-camera 3d object detection with polar transformer

Y Jiang, L Zhang, Z Miao, X Zhu, J Gao, W Hu… - Proceedings of the …, 2023 - ojs.aaai.org
Abstract 3D object detection in autonomous driving aims to reason “what” and “where” the
objects of interest present in a 3D world. Following the conventional wisdom of previous 2D …

Vision-centric bev perception: A survey

Y Ma, T Wang, X Bai, H Yang, Y Hou… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant
interest from both industry and academia due to its inherent advantages, such as providing …

Cobevt: Cooperative bird's eye view semantic segmentation with sparse transformers

R Xu, Z Tu, H Xiang, W Shao, B Zhou, J Ma - arXiv preprint arXiv …, 2022 - arxiv.org
Bird's eye view (BEV) semantic segmentation plays a crucial role in spatial sensing for
autonomous driving. Although recent literature has made significant progress on BEV map …

Persformer: 3d lane detection via perspective transformer and the openlane benchmark

L Chen, C Sima, Y Li, Z Zheng, J Xu, X Geng… - … on Computer Vision, 2022 - Springer
Methods for 3D lane detection have been recently proposed to address the issue of
inaccurate lane layouts in many autonomous driving scenarios (uphill/downhill, bump, etc.) …

Bevsegformer: Bird's eye view semantic segmentation from arbitrary camera rigs

L Peng, Z Chen, Z Fu, P Liang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Semantic segmentation in bird's eye view (BEV) is an important task for autonomous driving.
Though this task has attracted a large amount of research efforts, it is still challenging to …