In this paper, we consider the problem of audio-visual synchronisation applied to videosin- the-wild'(ie of general classes beyond speech). As a new task, we identify and curate a test …
T Xie, L Liao, C Bi, B Tang, X Yin, J Yang… - Proceedings of the 29th …, 2021 - dl.acm.org
The task of few-shot visual dubbing focuses on synchronizing the lip movements with arbitrary speech input for any talking head video. Albeit moderate improvements in current …
U Muaz, W Jang, R Tripathi, S Mani… - Proceedings of the …, 2023 - openaccess.thecvf.com
Dubbed video generation aims to accurately synchronize mouth movements of a given facial video with driving audio while preserving identity and scene-specific visual dynamics, such …
Dubbing is a post-production process of re-recording actors' dialogues, which is extensively used in filmmaking and video production. It is usually performed manually by professional …
In this paper we present VDTTS, a Visually-Driven Text-to-Speech model. Motivated by dubbing, VDTTS takes advantage of video frames as an additional input alongside text, and …
D Bigioi, P Corcoran - Frontiers in Signal Processing, 2023 - frontiersin.org
The proliferation of multi-lingual content on today's streaming services has created a need for automated multi-lingual dubbing tools. In this article, current state-of-the-art approaches …
This work reviews the dataset-driven advancements that have occurred in the area of lip motion analysis, particularly visual lip-reading and visual lip motion authentication, in the …
N Singh, CW Wu, I Orife… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Audiovisual representation learning typically relies on the correspondence between sight and sound. However there are often multiple audio tracks that can correspond with a visual …
Visual dubbing uses visual computing and deep learning to alter the lip and mouth articulations of the actor to sync with the dubbed speech. It has the potential to greatly …