Codetalker: Speech-driven 3d facial animation with discrete motion prior

J Xing, M Xia, Y Zhang, X Cun… - Proceedings of the …, 2023 - openaccess.thecvf.com
Speech-driven 3D facial animation has been widely studied, yet there is still a gap to
achieving realism and vividness due to the highly ill-posed nature and scarcity of audio …

Dae-talker: High fidelity speech-driven talking face generation with diffusion autoencoder

C Du, Q Chen, T He, X Tan, X Chen, K Yu… - Proceedings of the 31st …, 2023 - dl.acm.org
While recent research has made significant progress in speech-driven talking face
generation, the quality of the generated video still lags behind that of real recordings. One …

Talking head generation with probabilistic audio-to-visual diffusion priors

Z Yu, Z Yin, D Zhou, D Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce a novel framework for one-shot audio-driven talking head generation. Unlike
prior works that require additional driving sources for controlled synthesis in a deterministic …

Application of a 3D Talking Head as Part of Telecommunication AR, VR, MR System: Systematic Review

N Christoff, NN Neshov, K Tonchev, A Manolova - Electronics, 2023 - mdpi.com
In today's digital era, the realms of virtual reality (VR), augmented reality (AR), and mixed
reality (MR) collectively referred to as extended reality (XR) are reshaping human–computer …

Selftalk: A self-supervised commutative training diagram to comprehend 3d talking faces

Z Peng, Y Luo, Y Shi, H Xu, X Zhu, H Liu, J He… - Proceedings of the 31st …, 2023 - dl.acm.org
Speech-driven 3D face animation technique, extending its applications to various
multimedia fields. Previous research has generated promising realistic lip movements and …

Controllable image synthesis methods, applications and challenges: a comprehensive survey

S Huang, Q Li, J Liao, S Wang, L Liu, L Li - Artificial Intelligence Review, 2024 - Springer
Abstract Controllable Image Synthesis (CIS) is a methodology that allows users to generate
desired images or manipulate specific attributes of images by providing precise input …

Scantalk: 3d talking heads from unregistered scans

F Nocentini, T Besnier, C Ferrari, S Arguillere… - … on Computer Vision, 2024 - Springer
Speech-driven 3D talking heads generation has emerged as a significant area of interest
among researchers, presenting numerous challenges. Existing methods are constrained by …

Style2talker: High-resolution talking head generation with emotion style and art style

S Tan, B Ji, Y Pan - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Although automatically animating audio-driven talking heads has recently received growing
interest, previous efforts have mainly concentrated on achieving lip synchronization with the …

Say anything with any style

S Tan, B Ji, Y Ding, Y Pan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Generating stylized talking head with diverse head motions is crucial for achieving natural-
looking videos but still remains challenging. Previous works either adopt a regressive …

KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding

Z Xu, S Gong, J Tang, L Liang, Y Huang, H Li… - … on Computer Vision, 2024 - Springer
We present a novel approach for synthesizing 3D facial motions from audio sequences
using key motion embeddings. Despite recent advancements in data-driven techniques …