Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion

V Voleti, CH Yao, M Boss, A Letts, D Pankratz… - … on Computer Vision, 2025 - Springer
Abstract We present Stable Video 3D (SV3D)—a latent video diffusion model for high-
resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent …

Sparp: Fast 3d object reconstruction and pose estimation from sparse views

C Xu, A Li, L Chen, Y Liu, R Shi, H Su, M Liu - European Conference on …, 2025 - Springer
Open-world 3D generation has recently attracted considerable attention. While many single-
image-to-3D methods have yielded visually appealing outcomes, they often lack sufficient …

Echoscene: Indoor scene generation via information echo over scene graph diffusion

G Zhai, EP Örnek, DZ Chen, R Liao, Y Di… - … on Computer Vision, 2025 - Springer
We present EchoScene, an interactive and controllable generative model that generates 3D
indoor scenes on scene graphs. EchoScene leverages a dual-branch diffusion model that …

MorpheuS: Neural Dynamic 360deg Surface Reconstruction from Monocular RGB-D Video

H Wang, J Wang, L Agapito - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Neural rendering has demonstrated remarkable success in dynamic scene reconstruction.
Thanks to the expressiveness of neural representations prior works can accurately capture …

Tailor3d: Customized 3d assets editing and generation with dual-side images

Z Qi, Y Yang, M Zhang, L Xing, X Wu, T Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advances in 3D AIGC have shown promise in directly creating 3D objects from text
and images, offering significant cost savings in animation and product design. However …

Magic-boost: Boost 3d generation with mutli-view conditioned diffusion

F Yang, J Zhang, Y Shi, B Chen, C Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Benefiting from the rapid development of 2D diffusion models, 3D content creation has made
significant progress recently. One promising solution involves the fine-tuning of pre-trained …

Drive-1-to-3: Enriching diffusion priors for novel view synthesis of real vehicles

C Lin, B Zhuang, S Sun, Z Jiang, J Cai… - arXiv preprint arXiv …, 2024 - arxiv.org
The recent advent of large-scale 3D data, eg Objaverse, has led to impressive progress in
training pose-conditioned diffusion models for novel view synthesis. However, due to the …

Zero123-6d: Zero-shot novel view synthesis for rgb category-level 6d pose estimation

F Di Felice, A Remus, S Gasperini, B Busam… - arXiv preprint arXiv …, 2024 - arxiv.org
Estimating the pose of objects through vision is essential to make robotic platforms interact
with the environment. Yet, it presents many challenges, often related to the lack of flexibility …

Dimvis: Diffusion-based multi-view synthesis

G Di Giacomo, G Franzese, T Cerquitelli… - ICML 2024 Workshop …, 2024 - openreview.net
Multi-view observations offer a broader perception of the real world, compared to
observations acquired from a single viewpoint. While existing multi-view 2D diffusion models …

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

L Yang, Z Zhang, J Han, B Zeng, R Li, P Torr… - arXiv preprint arXiv …, 2024 - arxiv.org
Generating high-quality 3D assets from textual descriptions remains a pivotal challenge in
computer graphics and vision research. Due to the scarcity of 3D data, state-of-the-art …