Talking head from speech audio using a pre-trained image generator

J Xing, M Xia, Y Zhang, X Cun… - Proceedings of the …, 2023 - openaccess.thecvf.com

Speech-driven 3D facial animation has been widely studied, yet there is still a gap to
achieving realism and vividness due to the highly ill-posed nature and scarcity of audio …

被引用次数：147 相关文章所有 8 个版本

[PDF] arxiv.org

Dae-talker: High fidelity speech-driven talking face generation with diffusion autoencoder

C Du, Q Chen, T He, X Tan, X Chen, K Yu… - Proceedings of the 31st …, 2023 - dl.acm.org

While recent research has made significant progress in speech-driven talking face
generation, the quality of the generated video still lags behind that of real recordings. One …

被引用次数：41 相关文章所有 4 个版本

[PDF] thecvf.com

Talking head generation with probabilistic audio-to-visual diffusion priors

Z Yu, Z Yin, D Zhou, D Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

We introduce a novel framework for one-shot audio-driven talking head generation. Unlike
prior works that require additional driving sources for controlled synthesis in a deterministic …

被引用次数：33 相关文章所有 5 个版本

[PDF] mdpi.com

Application of a 3D Talking Head as Part of Telecommunication AR, VR, MR System: Systematic Review

N Christoff, NN Neshov, K Tonchev, A Manolova - Electronics, 2023 - mdpi.com

In today's digital era, the realms of virtual reality (VR), augmented reality (AR), and mixed
reality (MR) collectively referred to as extended reality (XR) are reshaping human–computer …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

Selftalk: A self-supervised commutative training diagram to comprehend 3d talking faces

Z Peng, Y Luo, Y Shi, H Xu, X Zhu, H Liu, J He… - Proceedings of the 31st …, 2023 - dl.acm.org

Speech-driven 3D face animation technique, extending its applications to various
multimedia fields. Previous research has generated promising realistic lip movements and …

被引用次数：35 相关文章所有 3 个版本

[PDF] springer.com

Controllable image synthesis methods, applications and challenges: a comprehensive survey

S Huang, Q Li, J Liao, S Wang, L Liu, L Li - Artificial Intelligence Review, 2024 - Springer

Abstract Controllable Image Synthesis (CIS) is a methodology that allows users to generate
desired images or manipulate specific attributes of images by providing precise input …

[PDF] arxiv.org

Scantalk: 3d talking heads from unregistered scans

F Nocentini, T Besnier, C Ferrari, S Arguillere… - … on Computer Vision, 2024 - Springer

Speech-driven 3D talking heads generation has emerged as a significant area of interest
among researchers, presenting numerous challenges. Existing methods are constrained by …

被引用次数：4 相关文章所有 2 个版本

[PDF] aaai.org

Style2talker: High-resolution talking head generation with emotion style and art style

S Tan, B Ji, Y Pan - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

Although automatically animating audio-driven talking heads has recently received growing
interest, previous efforts have mainly concentrated on achieving lip synchronization with the …

被引用次数：13 相关文章所有 3 个版本

[PDF] aaai.org

Say anything with any style

S Tan, B Ji, Y Ding, Y Pan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Generating stylized talking head with diverse head motions is crucial for achieving natural-
looking videos but still remains challenging. Previous works either adopt a regressive …

被引用次数：11 相关文章所有 3 个版本

[PDF] arxiv.org

KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding

Z Xu, S Gong, J Tang, L Liang, Y Huang, H Li… - … on Computer Vision, 2024 - Springer

We present a novel approach for synthesizing 3D facial motions from audio sequences
using key motion embeddings. Despite recent advancements in data-driven techniques …

被引用次数：2 相关文章所有 6 个版本

高级搜索

QQ 群