Scene as occupancy

W Tong, C Sima, T Wang, L Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human driver can easily describe the complex traffic scene by visual system. Such an ability
of precise perception is essential for driver's planning. To achieve this, a geometry-aware …

Visual point cloud forecasting enables scalable autonomous driving

Z Yang, L Chen, Y Sun, H Li - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
In contrast to extensive studies on general vision pre-training for scalable visual
autonomous driving remains seldom explored. Visual autonomous driving applications …

Leveraging vision-centric multi-modal expertise for 3d object detection

L Huang, Z Li, C Sima, W Wang… - Advances in Neural …, 2024 - proceedings.neurips.cc
Current research is primarily dedicated to advancing the accuracy of camera-only 3D object
detectors (apprentice) through the knowledge transferred from LiDAR-or multi-modal-based …

Tig-bev: Multi-view bev 3d object detection via target inner-geometry learning

P Huang, L Liu, R Zhang, S Zhang, X Xu… - arXiv preprint arXiv …, 2022 - arxiv.org
To achieve accurate and low-cost 3D object detection, existing methods propose to benefit
camera-based multi-view detectors with spatial cues provided by the LiDAR modality, eg …

Not all voxels are equal: Hardness-aware semantic scene completion with self-distillation

S Wang, J Yu, W Li, W Liu, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Semantic scene completion also known as semantic occupancy prediction can provide
dense geometric and semantic information for autonomous vehicles which attracts the …

CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation

L Zhao, J Song, KA Skinner - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
In the field of 3D object detection for autonomous driving LiDAR-Camera (LC) fusion is the
top-performing sensor configuration. Still LiDAR is relatively high cost which hinders …

Adaptive Learning against Muscle Fatigue for A-mode Ultrasound based Gesture Recognition

J Zeng, Y Sheng, Z Zhou, Y Yang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
It is evident that the state-of-the-art in multisensory hand gesture recognition indicates the
superior performance of the A-mode ultrasound (AUS) modality over its counterparts …

InstKD: Towards Lightweight 3D Object Detection With Instance-Aware Knowledge Distillation

H Zhang, L Liu, Y Huang, X Lei… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep neural network (DNN) is extensively explored for LiDAR-based 3D object detection, a
crucial perception task in the field of autonomous driving. However, the presence of …

LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

S Kim, Y Kim, S Hwang, H Jeong, D Kum - arXiv preprint arXiv:2407.10164, 2024 - arxiv.org
Recent advancements in camera-based 3D object detection have introduced cross-modal
knowledge distillation to bridge the performance gap with LiDAR 3D detectors, leveraging …

Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection

H Zheng, D Cao, J Xu, R Ai, W Gu, Y Yang… - arXiv preprint arXiv …, 2024 - arxiv.org
Striking a balance between precision and efficiency presents a prominent challenge in the
bird's-eye-view (BEV) 3D object detection. Although previous camera-based BEV methods …