Gpt-4v (ision) is a human-aligned evaluator for text-to-3d generation

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library

The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

被引用次数：86 相关文章所有 12 个版本

[PDF] arxiv.org

Evaluating text-to-visual generation with image-to-text generation

Z Lin, D Pathak, B Li, J Li, X Xia, G Neubig… - … on Computer Vision, 2025 - Springer

Despite significant progress in generative AI, comprehensive evaluation remains
challenging because of the lack of effective metrics and standardized benchmarks. For …

被引用次数：53 相关文章所有 2 个版本

[PDF] arxiv.org

A comprehensive survey on 3D content generation

J Liu, X Huang, T Huang, L Chen, Y Hou… - arXiv preprint arXiv …, 2024 - arxiv.org

Recent years have witnessed remarkable advances in artificial intelligence generated
content (AIGC), with diverse input modalities, eg, text, image, video, audio and 3D. The 3D is …

被引用次数：20 相关文章所有 2 个版本

[PDF] thecvf.com

Luciddreamer: Towards high-fidelity text-to-3d generation via interval score matching

Y Liang, X Yang, J Lin, H Li, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com

The recent advancements in text-to-3D generation mark a significant milestone in generative
models unlocking new possibilities for creating imaginative 3D assets across various real …

被引用次数：102 相关文章所有 4 个版本

[PDF] arxiv.org

Shapellm: Universal 3d object understanding for embodied interaction

Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge… - … on Computer Vision, 2025 - Springer

This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …

被引用次数：32 相关文章所有 2 个版本

[PDF] arxiv.org

Comboverse: Compositional 3d assets creation using spatially-aware diffusion guidance

Y Chen, T Wang, T Wu, X Pan, K Jia, Z Liu - European Conference on …, 2025 - Springer

Generating high-quality 3D assets from a given image is highly desirable in various
applications such as AR/VR. Recent advances in single-image 3D generation explore feed …

被引用次数：20 相关文章所有 2 个版本

[PDF] arxiv.org

Generative ai meets 3d: A survey on text-to-3d in aigc era

C Li, C Zhang, J Cho, A Waghwase, LH Lee… - arXiv preprint arXiv …, 2023 - arxiv.org

Generative AI has made significant progress in recent years, with text-guided content
generation being the most practical as it facilitates interaction between human instructions …

被引用次数：67 相关文章所有 4 个版本

[PDF] arxiv.org

Tc4d: Trajectory-conditioned text-to-4d generation

S Bahmani, X Liu, W Yifan, I Skorokhodov… - … on Computer Vision, 2025 - Springer

Recent techniques for text-to-4D generation synthesize dynamic 3D scenes using
supervision from pre-trained text-to-video models. However, existing representations, such …

被引用次数：19 相关文章所有 3 个版本

[PDF] arxiv.org

Dreamreward: Text-to-3d generation with human preference

J Ye, F Liu, Q Li, Z Wang, Y Wang, X Wang… - … on Computer Vision, 2025 - Springer

Abstract 3D content creation from text prompts has shown remarkable success recently.
However, current text-to-3D methods often generate 3D results that do not align well with …

被引用次数：16 相关文章所有 4 个版本

[PDF] thecvf.com

Taming mode collapse in score distillation for text-to-3d generation

P Wang, D Xu, Z Fan, D Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Despite the remarkable performance of score distillation in text-to-3D generation such
techniques notoriously suffer from view inconsistency issues also known as" Janus" artifact …

被引用次数：13 相关文章所有 4 个版本

高级搜索

QQ 群