Embracing single stride 3d object detector with sparse transformer

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

被引用次数：80 相关文章所有 7 个版本

[PDF] arxiv.org

3D object detection for autonomous driving: A comprehensive survey

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer

Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

被引用次数：79 相关文章所有 7 个版本

[PDF] thecvf.com

Voxelnext: Fully sparse voxelnet for 3d object detection and tracking

Y Chen, J Liu, X Zhang, X Qi… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Abstract 3D object detectors usually rely on hand-crafted proxies, eg, anchors or centers,
and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be …

被引用次数：128 相关文章所有 6 个版本

[PDF] thecvf.com

Deepfusion: Lidar-camera deep fusion for multi-modal 3d object detection

Y Li, AW Yu, T Meng, B Caine… - Proceedings of the …, 2022 - openaccess.thecvf.com

Lidars and cameras are critical sensors that provide complementary information for 3D
detection in autonomous driving. While prevalent multi-modal methods simply decorate raw …

被引用次数：271 相关文章所有 7 个版本

[PDF] thecvf.com

V2v4real: A real-world large-scale dataset for vehicle-to-vehicle cooperative perception

R Xu, X Xia, J Li, H Li, S Zhang, Z Tu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Modern perception systems of autonomous vehicles are known to be sensitive to occlusions
and lack the capability of long perceiving range. It has been one of the key bottlenecks that …

被引用次数：88 相关文章所有 7 个版本

[PDF] thecvf.com

Spherical transformer for lidar-based 3d recognition

X Lai, Y Chen, F Lu, J Liu, J Jia - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

LiDAR-based 3D point cloud recognition has benefited various applications. Without
specially considering the LiDAR point distribution, most current methods suffer from …

被引用次数：79 相关文章所有 6 个版本

[PDF] thecvf.com

Point Transformer V3: Simpler Faster Stronger

X Wu, L Jiang, PS Wang, Z Liu, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

This paper is not motivated to seek innovation within the attention mechanism. Instead it
focuses on overcoming the existing trade-offs between accuracy and efficiency within the …

被引用次数：24 相关文章所有 2 个版本

[PDF] mdpi.com

A survey of visual transformers

Y Liu, Y Zhang, Y Wang, F Hou, J Yuan… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Transformer, an attention-based encoder–decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some …

被引用次数：276 相关文章所有 13 个版本

[PDF] arxiv.org

Swformer: Sparse window transformer for 3d object detection in point clouds

P Sun, M Tan, W Wang, C Liu, F Xia, Z Leng… - … on Computer Vision, 2022 - Springer

Abstract 3D object detection in point clouds is a core component for modern robotics and
autonomous driving systems. A key challenge in 3D object detection comes from the …

被引用次数：76 相关文章所有 5 个版本

[PDF] thecvf.com

Logonet: Towards accurate 3d object detection with local-to-global cross-modal fusion

X Li, T Ma, Y Hou, B Shi, Y Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com

LiDAR-camera fusion methods have shown impressive performance in 3D object detection.
Recent advanced multi-modal methods mainly perform global fusion, where image features …

被引用次数：59 相关文章所有 6 个版本

高级搜索

QQ 群