Sapiens: Foundation for human vision models

R Khirodkar, T Bagautdinov, J Martinez… - … on Computer Vision, 2024 - Springer
We present Sapiens, a family of models for four fundamental human-centric vision tasks–2D
pose estimation, body-part segmentation, depth estimation, and surface normal prediction …

One-shot high-fidelity talking-head synthesis with deformable neural radiance field

W Li, L Zhang, D Wang, B Zhao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Talking head generation aims to generate faces that maintain the identity information of the
source image and imitate the motion of the driving image. Most pioneering methods rely …

Real-time radiance fields for single-image portrait view synthesis

A Trevithick, M Chan, M Stengel, E Chan… - ACM Transactions on …, 2023 - dl.acm.org
We present a one-shot method to infer and render a photorealistic 3D representation from a
single unposed image (eg, face portrait) in real-time. Given a single RGB input, our image …

Metaportrait: Identity-preserving talking head generation with fast personalized adaptation

B Zhang, C Qi, P Zhang, B Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this work, we propose an ID-preserving talking head generation framework, which
advances previous methods in two aspects. First, as opposed to interpolating from sparse …

X-portrait: Expressive portrait animation with hierarchical motion attention

Y Xie, H Xu, G Song, C Wang, Y Shi, L Luo - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
We propose X-Portrait, an innovative conditional diffusion model tailored for generating
expressive and temporally coherent portrait animation. Specifically, given a single portrait as …

Follow-your-emoji: Fine-controllable and expressive freestyle portrait animation

Y Ma, H Liu, H Wang, H Pan, Y He, J Yuan… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
We present Follow-Your-Emoji, a diffusion-based framework for portrait animation, which
animates a reference portrait with target landmark sequences. The main challenge of portrait …

Latentavatar: Learning latent expression code for expressive neural head avatar

Y Xu, H Zhang, L Wang, X Zhao, H Huang… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
Existing approaches to animatable NeRF-based head avatars are either built upon face
templates or use the expression coefficients of templates as the driving signal. Despite the …

Liveportrait: Efficient portrait animation with stitching and retargeting control

J Guo, D Zhang, X Liu, Z Zhong, Y Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Portrait Animation aims to synthesize a lifelike video from a single source image, using it as
an appearance reference, with motion (ie, facial expressions and head pose) derived from a …

Unsupervised volumetric animation

A Siarohin, W Menapace… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a novel approach for unsupervised 3D animation of non-rigid deformable
objects. Our method learns the 3D structure and dynamics of objects solely from single-view …

Avatarmav: Fast 3d head avatar reconstruction using motion-aware neural voxels

Y Xu, L Wang, X Zhao, H Zhang, Y Liu - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
With NeRF widely used for facial reenactment, recent methods can recover photo-realistic
3D head avatar from just a monocular video. Unfortunately, the training process of the NeRF …