State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

Evaluating text-to-visual generation with image-to-text generation

Z Lin, D Pathak, B Li, J Li, X Xia, G Neubig… - … on Computer Vision, 2025 - Springer
Despite significant progress in generative AI, comprehensive evaluation remains
challenging because of the lack of effective metrics and standardized benchmarks. For …

A comprehensive survey on 3D content generation

J Liu, X Huang, T Huang, L Chen, Y Hou… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent years have witnessed remarkable advances in artificial intelligence generated
content (AIGC), with diverse input modalities, eg, text, image, video, audio and 3D. The 3D is …

Luciddreamer: Towards high-fidelity text-to-3d generation via interval score matching

Y Liang, X Yang, J Lin, H Li, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
The recent advancements in text-to-3D generation mark a significant milestone in generative
models unlocking new possibilities for creating imaginative 3D assets across various real …

Shapellm: Universal 3d object understanding for embodied interaction

Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge… - … on Computer Vision, 2025 - Springer
This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …

Comboverse: Compositional 3d assets creation using spatially-aware diffusion guidance

Y Chen, T Wang, T Wu, X Pan, K Jia, Z Liu - European Conference on …, 2025 - Springer
Generating high-quality 3D assets from a given image is highly desirable in various
applications such as AR/VR. Recent advances in single-image 3D generation explore feed …

Generative ai meets 3d: A survey on text-to-3d in aigc era

C Li, C Zhang, J Cho, A Waghwase, LH Lee… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative AI has made significant progress in recent years, with text-guided content
generation being the most practical as it facilitates interaction between human instructions …

Tc4d: Trajectory-conditioned text-to-4d generation

S Bahmani, X Liu, W Yifan, I Skorokhodov… - … on Computer Vision, 2025 - Springer
Recent techniques for text-to-4D generation synthesize dynamic 3D scenes using
supervision from pre-trained text-to-video models. However, existing representations, such …

Dreamreward: Text-to-3d generation with human preference

J Ye, F Liu, Q Li, Z Wang, Y Wang, X Wang… - … on Computer Vision, 2025 - Springer
Abstract 3D content creation from text prompts has shown remarkable success recently.
However, current text-to-3D methods often generate 3D results that do not align well with …

Taming mode collapse in score distillation for text-to-3d generation

P Wang, D Xu, Z Fan, D Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Despite the remarkable performance of score distillation in text-to-3D generation such
techniques notoriously suffer from view inconsistency issues also known as" Janus" artifact …