C Min, D Zhao, L Xiao, J Zhao, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Vision-centric autonomous driving has recently raised wide attention due to its lower cost. Pre-training is essential for extracting a universal representation. However current vision …
The rise of large foundation models, trained on extensive datasets, is revolutionizing the field of AI. Models such as SAM, DALL-E2, and GPT-4 showcase their adaptability by …
Vision-based occupancy prediction, also known as 3D Semantic Scene Completion (SSC), presents a significant challenge in computer vision. Previous methods, confined to onboard …
Popular representation learning methods encourage feature invariance under transformations applied at the input. However, in 3D perception tasks like object localization …