Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering

B Fei, Y Li, W Yang, L Ma, Y He - arXiv preprint arXiv:2404.13619, 2024 - arxiv.org
State-of-the-art 3D models, which excel in recognition tasks, typically depend on large-scale
datasets and well-defined category sets. Recent advances in multi-modal pre-training have …

Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering

B Fei, Y Li, W Yang, L Ma, Y He - arXiv e-prints, 2024 - ui.adsabs.harvard.edu
Abstract State-of-the-art 3D models, which excel in recognition tasks, typically depend on
large-scale datasets and well-defined category sets. Recent advances in multi-modal pre …