SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving

Z Yang, L Chen, Y Sun, H Li - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com

In contrast to extensive studies on general vision pre-training for scalable visual
autonomous driving remains seldom explored. Visual autonomous driving applications …

被引用次数：8 相关文章所有 2 个版本

[PDF] thecvf.com

Driveworld: 4d pre-trained scene understanding via world models for autonomous driving

C Min, D Zhao, L Xiao, J Zhao, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Vision-centric autonomous driving has recently raised wide attention due to its lower cost.
Pre-training is essential for extracting a universal representation. However current vision …

被引用次数：2 相关文章所有 4 个版本

[PDF] arxiv.org

Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

X Yan, H Zhang, Y Cai, J Guo, W Qiu, B Gao… - arXiv preprint arXiv …, 2024 - arxiv.org

The rise of large foundation models, trained on extensive datasets, is revolutionizing the
field of AI. Models such as SAM, DALL-E2, and GPT-4 showcase their adaptability by …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

Occfiner: Offboard occupancy refinement with hybrid propagation

H Shi, S Wang, J Zhang, X Yin, Z Wang, Z Zhao… - arXiv preprint arXiv …, 2024 - arxiv.org

Vision-based occupancy prediction, also known as 3D Semantic Scene Completion (SSC),
presents a significant challenge in computer vision. Previous methods, confined to onboard …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection

D Hegde, S Lohit, KC Peng, MJ Jones… - arXiv preprint arXiv …, 2024 - arxiv.org

Popular representation learning methods encourage feature invariance under
transformations applied at the input. However, in 3D perception tasks like object localization …

高级搜索

QQ 群