Advancing 3D point cloud understanding through deep transfer learning: A comprehensive survey

SS Sohail, Y Himeur, H Kheddar, A Amira, F Fadli… - Information …, 2024 - Elsevier
The 3D point cloud (3DPC) has significantly evolved and benefited from the advance of
deep learning (DL). However, the latter faces various issues, including the lack of data or …

Instantmesh: Efficient 3d mesh generation from a single image with sparse-view large reconstruction models

J Xu, W Cheng, Y Gao, X Wang, S Gao… - arXiv preprint arXiv …, 2024 - arxiv.org
We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a
single image, featuring state-of-the-art generation quality and significant training scalability …

Sv4d: Dynamic 3d content generation with multi-frame and multi-view consistency

Y Xie, CH Yao, V Voleti, H Jiang, V Jampani - arXiv preprint arXiv …, 2024 - arxiv.org
We present Stable Video 4D (SV4D), a latent video diffusion model for multi-frame and multi-
view consistent dynamic 3D content generation. Unlike previous methods that rely on …

Reconx: Reconstruct any scene from sparse views with video diffusion model

F Liu, W Sun, H Wang, Y Wang, H Sun, J Ye… - arXiv preprint arXiv …, 2024 - arxiv.org
Advancements in 3D scene reconstruction have transformed 2D images from the real world
into 3D models, producing realistic 3D results from hundreds of input photos. Despite great …

Drivedreamer4d: World models are effective data machines for 4d driving scene representation

G Zhao, C Ni, X Wang, Z Zhu, X Zhang, Y Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Closed-loop simulation is essential for advancing end-to-end autonomous driving systems.
Contemporary sensor simulation methods, such as NeRF and 3DGS, rely predominantly on …

Dimensionx: Create any 3d and 4d scenes from a single image with controllable video diffusion

W Sun, S Chen, F Liu, Z Chen, Y Duan, J Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce\textbf {DimensionX}, a framework designed to generate
photorealistic 3D and 4D scenes from just a single image with video diffusion. Our approach …

3dgs-enhancer: Enhancing unbounded 3d gaussian splatting with view-consistent 2d diffusion priors

X Liu, C Zhou, S Huang - arXiv preprint arXiv:2410.16266, 2024 - arxiv.org
Novel-view synthesis aims to generate novel views of a scene from multiple input images or
videos, and recent advancements like 3D Gaussian splatting (3DGS) have achieved notable …

Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models

H Yang, Y Chen, Y Pan, T Yao, Z Chen… - Proceedings of the …, 2024 - dl.acm.org
Despite having tremendous progress in image-to-3D generation, existing methods still
struggle to produce multi-view consistent images with high-resolution textures in detail …

Phidias: A generative model for creating 3d content from text, image, and 3d conditions with reference-augmented diffusion

Z Wang, T Wang, Z He, G Hancke, Z Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
In 3D modeling, designers often use an existing 3D model as a reference to create new
ones. This practice has inspired the development of Phidias, a novel generative model that …

Large point-to-gaussian model for image-to-3d generation

L Lu, H Gao, T Dai, Y Zha, Z Hou, J Wu… - Proceedings of the 32nd …, 2024 - dl.acm.org
Recently, image-to-3D approaches have significantly advanced the generation quality and
speed of 3D assets based on large reconstruction models, particularly 3D Gaussian …