Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Nerf: Neural radiance field in 3d vision, a comprehensive review

K Gao, Y Gao, H He, D Lu, L Xu, J Li - arXiv preprint arXiv:2210.00379, 2022 - arxiv.org
Neural Radiance Field (NeRF), a new novel view synthesis with implicit scene
representation has taken the field of Computer Vision by storm. As a novel view synthesis …

Rodin: A generative model for sculpting 3d digital avatars using diffusion

T Wang, B Zhang, T Zhang, S Gu… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper presents a 3D diffusion model that automatically generates 3D digital avatars
represented as neural radiance fields (NeRFs). A significant challenge for 3D diffusion is …

Magic123: One image to high-quality 3d object generation using both 2d and 3d diffusion priors

G Qian, J Mai, A Hamdi, J Ren, A Siarohin, B Li… - arXiv preprint arXiv …, 2023 - arxiv.org
We present Magic123, a two-stage coarse-to-fine approach for high-quality, textured 3D
meshes generation from a single unposed image in the wild using both2D and 3D priors. In …

Tri-miprf: Tri-mip representation for efficient anti-aliasing neural radiance fields

W Hu, Y Wang, L Ma, B Yang, L Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite the tremendous progress in neural radiance fields (NeRF), we still face a dilemma of
the trade-off between quality and efficiency, eg, MipNeRF presents fine-detailed and anti …

Next3d: Generative neural texture rasterization for 3d-aware head avatars

J Sun, X Wang, L Wang, X Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D-aware generative adversarial networks (GANs) synthesize high-fidelity and
multi-view-consistent facial images using only collections of single-view 2D imagery …

Sine: Semantic-driven image-based nerf editing with prior-guided editing field

C Bao, Y Zhang, B Yang, T Fan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite the great success in 2D editing using user-friendly tools, such as Photoshop,
semantic strokes, or even text prompts, similar capabilities in 3D areas are still limited, either …

Gaussian shell maps for efficient 3d human generation

R Abdal, W Yifan, Z Shi, Y Xu, R Po… - Proceedings of the …, 2024 - openaccess.thecvf.com
Efficient generation of 3D digital humans is important in several industries including virtual
reality social media and cinematic production. 3D generative adversarial networks (GANs) …

High-fidelity 3d gan inversion by pseudo-multi-view optimization

J Xie, H Ouyang, J Piao, C Lei… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a high-fidelity 3D generative adversarial network (GAN) inversion framework
that can synthesize photo-realistic novel views while preserving specific details of the input …

Headsculpt: Crafting 3d head avatars with text

X Han, Y Cao, K Han, X Zhu, J Deng… - Advances in …, 2024 - proceedings.neurips.cc
Recently, text-guided 3D generative methods have made remarkable advancements in
producing high-quality textures and geometry, capitalizing on the proliferation of large vision …