Occgen: Generative multi-modal 3d occupancy prediction for autonomous driving

G Wang, Z Wang, P Tang, J Zheng, X Ren… - … on Computer Vision, 2025 - Springer
Existing 3D semantic occupancy prediction methods typically treat the task as a one-shot 3D
voxel-wise segmentation problem, focusing on a single-step mapping between the inputs …

Dg-pic: Domain generalized point-in-context learning for point cloud understanding

J Jiang, Q Zhou, Y Li, X Lu, M Wang, L Ma… - … on Computer Vision, 2025 - Springer
Recent point cloud understanding research suffers from performance drops on unseen data,
due to the distribution shifts across different domains. While recent studies use Domain …

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

H Lu, J Tang, X Xu, X Cao, Y Zhang, G Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
The emergence of Multi-Camera 3D Object Detection (MC3D-Det), facilitated by bird's-eye
view (BEV) representation, signifies a notable progression in 3D object detection. Scaling …