Is-fusion: Instance-scene collaborative fusion for multimodal 3d object detection

J Yin, J Shen, R Chen, W Li, R Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Bird's eye view (BEV) representation has emerged as a dominant solution for describing 3D
space in autonomous driving scenarios. However objects in the BEV representation typically …

Scatterformer: Efficient voxel transformer with scattered linear attention

C He, R Li, G Zhang, L Zhang - European Conference on Computer Vision, 2025 - Springer
Window-based transformers excel in large-scale point cloud understanding by capturing
context-aware representations with affordable attention computation in a more localized …

SEED: A Simple and Effective 3D DETR in Point Clouds

Z Liu, J Hou, X Ye, T Wang, J Wang, X Bai - European Conference on …, 2025 - Springer
Recently, detection transformers (DETRs) have gradually taken a dominant position in 2D
detection thanks to their elegant framework. However, DETR-based detectors for 3D point …

PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection

KC Huang, W Lyu, MH Yang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Recent temporal LiDAR-based 3D object detectors achieve promising performance based
on the two-stage proposal-based approach. They generate 3D box candidates from the first …

Spatial-temporal graph enhanced detr towards multi-frame 3d object detection

Y Zhang, Z Zhu, J Hou, D Wu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
The Detection Transformer (DETR) has revolutionized the design of CNN-based object
detection systems, showcasing impressive performance. However, its potential in the …

General Geometry-aware Weakly Supervised 3D Object Detection

G Zhang, J Fan, L Chen, Z Zhang, Z Lei… - European Conference on …, 2025 - Springer
Abstract 3D object detection is an indispensable component for scene understanding.
However, the annotation of large-scale 3D datasets requires significant human effort. To …

TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection

P Jacobson, Y Xie, M Ding, C Xu, M Tomizuka… - arXiv preprint arXiv …, 2024 - arxiv.org
Semi-supervised 3D object detection is a common strategy employed to circumvent the
challenge of manually labeling large-scale autonomous driving perception datasets. Pseudo …

Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences

R Yu, R Zhao, C Nie, H Wang, HC Yan… - arXiv preprint arXiv …, 2024 - arxiv.org
Accurate and robust LiDAR 3D object detection is essential for comprehensive scene
understanding in autonomous driving. Despite its importance, LiDAR detection performance …

GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs

M Liu, E Yurtsever, M Brede, J Meng, W Zimmer… - arXiv preprint arXiv …, 2024 - arxiv.org
Accurate and effective 3D object detection is critical for ensuring the driving safety of
autonomous vehicles. Recently, state-of-the-art two-stage 3D object detectors have …

Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation

R Yu, R Zhao, J Li, Q Zhao, S Zhu, HC Yan… - arXiv preprint arXiv …, 2024 - arxiv.org
The LiDAR-based 3D object detector that strikes a balance between accuracy and speed is
crucial for achieving real-time perception in autonomous driving and robotic navigation …