Triplane meets gaussian splatting: Fast and generalizable single-view 3d reconstruction with transformers

ZX Zou, Z Yu, YC Guo, Y Li, D Liang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advancements in 3D reconstruction from single images have been driven by the
evolution of generative models. Prominent among these are methods based on Score …

Generative ai meets 3d: A survey on text-to-3d in aigc era

C Li, C Zhang, J Cho, A Waghwase, LH Lee… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative AI has made significant progress in recent years, with text-guided content
generation being the most practical as it facilitates interaction between human instructions …

Triposr: Fast 3d object reconstruction from a single image

D Tochilkin, D Pankratz, Z Liu, Z Huang, A Letts… - arXiv preprint arXiv …, 2024 - arxiv.org
This technical report introduces TripoSR, a 3D reconstruction model leveraging transformer
architecture for fast feed-forward 3D generation, producing 3D mesh from a single image in …

Collaborative control for geometry-conditioned PBR image generation

S Vainer, M Boss, M Parger, K Kutsy… - … on Computer Vision, 2025 - Springer
Graphics pipelines require physically-based rendering (PBR) materials, yet current 3D
content generation approaches are built on RGB models. We propose to model the PBR …

An object is worth 64x64 pixels: Generating 3d object via image diffusion

X Yan, HH Lee, Z Wan, AX Chang - arXiv preprint arXiv:2408.03178, 2024 - arxiv.org
We introduce a new approach for generating realistic 3D models with UV maps through a
representation termed" Object Images." This approach encapsulates surface geometry …

Advances in text-guided 3D editing: a survey

L Lu, R Li, X Zhang, H Wei, G Du, B Wang - Artificial Intelligence Review, 2024 - Springer
Abstract In 3D Artificial Intelligence Generated Content (AIGC), compared with generating
3D assets from scratch, editing extant 3D assets satisfies user prompts, allowing the creation …

Dreammesh4d: Video-to-4d generation with sparse-controlled gaussian-mesh hybrid representation

Z Li, Y Chen, P Liu - arXiv preprint arXiv:2410.06756, 2024 - arxiv.org
Recent advancements in 2D/3D generative techniques have facilitated the generation of
dynamic 3D objects from monocular videos. Previous methods mainly rely on the implicit …

LLMs Meet Multimodal Generation and Editing: A Survey

Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models

S Chen, X Chen, A Pang, X Zeng, W Cheng… - arXiv preprint arXiv …, 2024 - arxiv.org
The polygon mesh representation of 3D data exhibits great flexibility, fast rendering speed,
and storage efficiency, which is widely preferred in various applications. However, given its …

Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark

H Xu, M Zhang, H Ju, Z Zheng, H Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org
Producing emotionally dynamic 3D facial avatars with text derived from spoken words
(Emo3D) has been a pivotal research topic in 3D avatar generation. While progress has …