State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

Nersemble: Multi-view radiance field reconstruction of human heads

T Kirschstein, S Qian, S Giebenhain, T Walter… - ACM Transactions on …, 2023 - dl.acm.org
We focus on reconstructing high-fidelity radiance fields of human heads, capturing their
animations over time, and synthesizing re-renderings from novel viewpoints at arbitrary time …

Mononphm: Dynamic head reconstruction from monocular videos

S Giebenhain, T Kirschstein… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract We present Monocular Neural Parametric Head Models (MonoNPHM) for dynamic
3D head reconstructions from monocular RGB videos. To this end we propose a latent …

Diffusionavatars: Deferred diffusion for high-fidelity 3d head avatars

T Kirschstein, S Giebenhain… - Proceedings of the …, 2024 - openaccess.thecvf.com
DiffusionAvatars synthesizes a high-fidelity 3D head avatar of a person offering intuitive
control over both pose and expression. We propose a diffusion-based neural renderer that …

Neural haircut: Prior-guided strand-based hair reconstruction

V Sklyarova, J Chelishev, A Dogaru… - Proceedings of the …, 2023 - openaccess.thecvf.com
Generating realistic human 3D reconstructions using image or video data is essential for
various communication and entertainment applications. While existing methods achieved …

Media2face: Co-speech facial animation generation with multi-modality guidance

Q Zhao, P Long, Q Zhang, D Qin, H Liang… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
The synthesis of 3D facial animations from speech has garnered considerable attention.
Due to the scarcity of high-quality 4D facial data and well-annotated abundant multi-modality …

Facetalk: Audio-driven motion diffusion for neural parametric head models

S Aneja, J Thies, A Dai… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We introduce FaceTalk a novel generative approach designed for synthesizing high-fidelity
3D motion sequences of talking human heads from input audio signal. To capture the …

What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs

A Trevithick, M Chan, T Takikawa… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract 3D-aware Generative Adversarial Networks (GANs) have shown remarkable
progress in learning to generate multi-view-consistent images and 3D geometries of scenes …

Npga: Neural parametric gaussian avatars

S Giebenhain, T Kirschstein, M Rünz… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
The creation of high-fidelity, digital versions of human heads is an important stepping stone
in the process of further integrating virtual components into our everyday lives. Constructing …

Advances in 3d generation: A survey

X Li, Q Zhang, D Kang, W Cheng, Y Gao… - arXiv preprint arXiv …, 2024 - arxiv.org
Generating 3D models lies at the core of computer graphics and has been the focus of
decades of research. With the emergence of advanced neural representations and …