Distillbev: Boosting multi-camera 3d object detection with cross-modal knowledge distillation

C Min, D Zhao, L Xiao, J Zhao, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Vision-centric autonomous driving has recently raised wide attention due to its lower cost.
Pre-training is essential for extracting a universal representation. However current vision …

被引用次数：2 相关文章所有 4 个版本

[PDF] thecvf.com

RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR Features

G Bang, K Choi, J Kim, D Kum… - Proceedings of the …, 2024 - openaccess.thecvf.com

The inherent noisy and sparse characteristics of radar data pose challenges in finding
effective representations for 3D object detection. In this paper we propose RadarDistill a …

被引用次数：2 相关文章所有 3 个版本

[PDF] thecvf.com

OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

S Sirko-Galouchenko, A Boulch… - Proceedings of the …, 2024 - openaccess.thecvf.com

We introduce a self-supervised pretraining method called OccFeat for camera-only Bird's-
Eye-View (BEV) segmentation networks. With OccFeat we pretrain a BEV network via …

SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects

A Kumar, Y Guo, X Huang, L Ren… - Proceedings of the …, 2024 - openaccess.thecvf.com

Monocular 3D detectors achieve remarkable performance on cars and smaller objects.
However their performance drops on larger objects leading to fatal accidents. Some attribute …

CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation

L Zhao, J Song, KA Skinner - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com

In the field of 3D object detection for autonomous driving LiDAR-Camera (LC) fusion is the
top-performing sensor configuration. Still LiDAR is relatively high cost which hinders …

Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training

Y Gao, Z Wang, WS Zheng, C Xie… - Proceedings of the …, 2024 - openaccess.thecvf.com

Contrastive learning has emerged as a promising paradigm for 3D open-world
understanding ie aligning point cloud representation to image and text embedding space …

[HTML] arxiv.org

Bev-io: Enhancing bird's-eye-view 3d detection with instance occupancy

Z Zhang, Y Zhang, L Wang, Y Wang, H Lu - arXiv preprint arXiv …, 2023 - arxiv.org

A popular approach for constructing bird's-eye-view (BEV) representation in 3D detection is
to lift 2D image features onto the viewing frustum space based on explicitly predicted depth …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

高级搜索

QQ 群