Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective...

W Chen, Y Li, Z Tian, F Zhang - Array, 2023 - Elsevier

Object detection is a crucial branch of computer vision that aims to locate and classify
objects in images. Using deep convolutional neural networks (CNNs) as the primary …

被引用次数：34 相关文章

[PDF] arxiv.org

Robustness-aware 3d object detection in autonomous driving: A review and outlook

Z Song, L Liu, F Jia, Y Luo, C Jia… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

In the realm of modern autonomous driving, the perception system is indispensable for
accurately assessing the state of the surrounding environment, thereby enabling informed …

被引用次数：15 相关文章所有 2 个版本

[PDF] thecvf.com

Exploring object-centric temporal modeling for efficient multi-view 3d object detection

S Wang, Y Liu, T Wang, Y Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

In this paper, we propose a long-sequence modeling framework, named StreamPETR, for
multi-view 3D object detection. Built upon the sparse query design in the PETR series, we …

被引用次数：118 相关文章所有 5 个版本

[PDF] thecvf.com

Sparsebev: High-performance sparse 3d object detection from multi-camera videos

H Liu, Y Teng, T Lu, H Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Camera-based 3D object detection in BEV (Bird's Eye View) space has drawn great
attention over the past few years. Dense detectors typically follow a two-stage pipeline by …

被引用次数：59 相关文章所有 5 个版本

[PDF] thecvf.com

Fb-bev: Bev representation from forward-backward view transformations

Z Li, Z Yu, W Wang, A Anandkumar… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract View Transformation Module (VTM), where transformations happen between multi-
view image features and Bird-Eye-View (BEV) representation, is a crucial step in camera …

被引用次数：48 相关文章所有 7 个版本

[PDF] thecvf.com

Unitr: A unified and efficient multi-modal transformer for bird's-eye-view representation

H Wang, H Tang, S Shi, A Li, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

Jointly processing information from multiple sensors is crucial to achieving accurate and
robust perception for reliable autonomous driving systems. However, current 3D perception …

被引用次数：35 相关文章所有 6 个版本

[PDF] thecvf.com

Unipad: A universal pre-training paradigm for autonomous driving

H Yang, S Zhang, D Huang, X Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com

In the context of autonomous driving the significance of effective feature learning is widely
acknowledged. While conventional 3D self-supervised pre-training methods have shown …

被引用次数：18 相关文章所有 4 个版本

[PDF] arxiv.org

Vision-centric bev perception: A survey

Y Ma, T Wang, X Bai, H Yang, Y Hou, Y Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant
interest from both industry and academia due to its inherent advantages, such as providing …

被引用次数：104 相关文章所有 2 个版本

[PDF] thecvf.com

Efficient deformable convnets: Rethinking dynamic and sparse operator for vision applications

Y Xiong, Z Li, Y Chen, F Wang, X Zhu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract We introduce Deformable Convolution v4 (DCNv4) a highly efficient and effective
operator designed for a broad spectrum of vision applications. DCNv4 addresses the …

被引用次数：23 相关文章所有 4 个版本

[PDF] thecvf.com

Is ego status all you need for open-loop end-to-end autonomous driving?

Z Li, Z Yu, S Lan, J Li, J Kautz, T Lu… - Proceedings of the …, 2024 - openaccess.thecvf.com

End-to-end autonomous driving recently emerged as a promising research direction to
target autonomy from a full-stack perspective. Along this line many of the latest works follow …

被引用次数：18 相关文章所有 3 个版本

高级搜索

QQ 群