An image is worth one word: Personalizing text-to-image generation using textual inversion

R Gal, Y Alaluf, Y Atzmon, O Patashnik… - arXiv preprint arXiv …, 2022 - arxiv.org
Text-to-image models offer unprecedented freedom to guide creation through natural
language. Yet, it is unclear how such freedom can be exercised to generate images of …

Rodin: A generative model for sculpting 3d digital avatars using diffusion

T Wang, B Zhang, T Zhang, S Gu… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper presents a 3D diffusion model that automatically generates 3D digital avatars
represented as neural radiance fields (NeRFs). A significant challenge for 3D diffusion is …

Encoder-based domain tuning for fast personalization of text-to-image models

R Gal, M Arar, Y Atzmon, AH Bermano… - ACM Transactions on …, 2023 - dl.acm.org
Text-to-image personalization aims to teach a pre-trained diffusion model to reason about
novel, user provided concepts, embedding them into new scenes guided by natural …

Gaussian head avatar: Ultra high-fidelity head avatar via dynamic gaussians

Y Xu, B Chen, Z Li, H Zhang, L Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Creating high-fidelity 3D head avatars has always been a research hotspot but there
remains a great challenge under lightweight sparse view setups. In this paper we propose …

Instant volumetric head avatars

W Zielonka, T Bolkart, J Thies - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract We present Instant Volumetric Head Avatars (INSTA), a novel approach for
reconstructing photo-realistic digital avatars instantaneously. INSTA models a dynamic …

Avatarrex: Real-time expressive full-body avatars

Z Zheng, X Zhao, H Zhang, B Liu, Y Liu - ACM Transactions on Graphics …, 2023 - dl.acm.org
We present AvatarReX, a new method for learning NeRF-based full-body avatars from video
data. The learnt avatar not only provides expressive control of the body, hands and the face …

Dreamface: Progressive generation of animatable 3d faces under text guidance

L Zhang, Q Qiu, H Lin, Q Zhang, C Shi, W Yang… - arXiv preprint arXiv …, 2023 - arxiv.org
Emerging Metaverse applications demand accessible, accurate, and easy-to-use tools for
3D digital human creations in order to depict different cultures and societies as if in the …

Domain-agnostic tuning-encoder for fast personalization of text-to-image models

M Arar, R Gal, Y Atzmon, G Chechik… - SIGGRAPH Asia 2023 …, 2023 - dl.acm.org
Text-to-image (T2I) personalization allows users to guide the creative image generation
process by combining their own visual concepts in natural language prompts. Recently …

Havatar: High-fidelity head avatar via facial model conditioned neural radiance field

X Zhao, L Wang, J Sun, H Zhang, J Suo… - ACM Transactions on …, 2023 - dl.acm.org
The problem of modeling an animatable 3D human head avatar under lightweight setups is
of significant importance but has not been well solved. Existing 3D representations either …

Latentavatar: Learning latent expression code for expressive neural head avatar

Y Xu, H Zhang, L Wang, X Zhao, H Huang… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
Existing approaches to animatable NeRF-based head avatars are either built upon face
templates or use the expression coefficients of templates as the driving signal. Despite the …