Real-world robot applications of foundation models: A review

K Kawaharazuka, T Matsushima… - Advanced …, 2024 - Taylor & Francis
Recent developments in foundation models, like Large Language Models (LLMs) and Vision-
Language Models (VLMs), trained on extensive data, facilitate flexible application across …

A comprehensive survey on 3D content generation

J Liu, X Huang, T Huang, L Chen, Y Hou… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent years have witnessed remarkable advances in artificial intelligence generated
content (AIGC), with diverse input modalities, eg, text, image, video, audio and 3D. The 3D is …

Foundationpose: Unified 6d pose estimation and tracking of novel objects

B Wen, W Yang, J Kautz… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We present FoundationPose a unified foundation model for 6D object pose estimation and
tracking supporting both model-based and model-free setups. Our approach can be instantly …

Generative ai meets 3d: A survey on text-to-3d in aigc era

C Li, C Zhang, J Cho, A Waghwase, LH Lee… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative AI has made significant progress in recent years, with text-guided content
generation being the most practical as it facilitates interaction between human instructions …

Texturedreamer: Image-guided texture synthesis through geometry-aware diffusion

YY Yeh, JB Huang, C Kim, L Xiao… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present TextureDreamer a novel image-guided texture synthesis method to transfer
relightable textures from a small number of input images (3 to 5) to target 3D shapes across …

Paint-it: Text-to-texture synthesis via deep convolutional texture map optimization and physically-based rendering

K Youwang, TH Oh… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We present Paint-it a text-driven high-fidelity texture map synthesis method for 3D meshes
via neural re-parameterized texture optimization. Paint-it synthesizes texture maps from a …

Paint3d: Paint anything 3d with lighting-less texture diffusion models

X Zeng, X Chen, Z Qi, W Liu, Z Zhao… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper presents Paint3D a novel coarse-to-fine generative framework that is capable of
producing high-resolution lighting-less and diverse 2K UV texture maps for untextured 3D …

Generative rendering: Controllable 4d-guided video generation with 2d diffusion models

S Cai, D Ceylan, M Gadelha… - Proceedings of the …, 2024 - openaccess.thecvf.com
Traditional 3D content creation tools empower users to bring their imagination to life by
giving them direct control over a scene's geometry appearance motion and camera path …

Anyhome: Open-vocabulary generation of structured and textured 3d homes

R Fu, Z Wen, Z Liu, S Sridhar - European Conference on Computer Vision, 2025 - Springer
Inspired by cognitive theories, we introduce AnyHome, a framework that translates any text
into well-structured and textured indoor scenes at a house-scale. By prompting Large …

Flashtex: Fast relightable mesh texturing with lightcontrolnet

K Deng, T Omernick, A Weiss, D Ramanan… - … on Computer Vision, 2025 - Springer
Manually creating textures for 3D meshes is time-consuming, even for expert visual content
creators. We propose a fast approach for automatically texturing an input 3D mesh based on …