The creation and detection of deepfakes: A survey

Y Mirsky, W Lee - ACM computing surveys (CSUR), 2021 - dl.acm.org
Generative deep learning algorithms have progressed to a point where it is difficult to tell the
difference between what is real and what is fake. In 2018, it was discovered how easy it is to …

Tensor methods in computer vision and deep learning

Y Panagakis, J Kossaifi, GG Chrysos… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Tensors, or multidimensional arrays, are data structures that can naturally represent visual
data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic …

A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation

L Liu, L Gao, W Lei, F Ma, X Lin, J Wang - arXiv preprint arXiv:2308.08849, 2023 - arxiv.org
Body language (BL) refers to the non-verbal communication expressed through physical
movements, gestures, facial expressions, and postures. It is a form of communication that …

SRG3: Speech-driven Robot Gesture Generation with GAN

C Yu, A Tapus - 2020 16th International Conference on Control …, 2020 - ieeexplore.ieee.org
The human gestures occur spontaneously and usually they are aligned with speech, which
leads to a natural and expressive interaction. Speech-driven gesture generation is important …

Audio driven artificial video face synthesis using gan and machine learning approaches

A Kumar Das, R Naskar - International Conference on Computational …, 2022 - Springer
Now-a-days a large number of people share their opinion in either audio or video format
through internet. Some of them are real videos and some are fake. So, we need to find out …

Robot behavior generation and human behavior understanding in natural human-robot interaction

C Yu - 2021 - theses.hal.science
Having a natural interaction makes a significant difference in a successful human-robot
interaction (HRI). The natural HRI refers to both human multimodal behavior understanding …

[PDF][PDF] Real-Time Speech-Driven Avatar Animation by Predicting Facial landmarks and Deformation Blendshapes

JC Vásquez-Correa, S Moreno-Acevedo… - Proceedings of the …, 2024 - aclanthology.org
The evolution of virtual spaces and live events demands sophisticated methods for avatar
animation. While existing techniques offer diverse approaches, limitations persist in …

Uncertainty-Based Multi-modal Learning for Myocardial Infarction Diagnosis Using Echocardiography and Electrocardiograms

Y Yang, M Rocher, P Moceri, M Sermesant - International Workshop on …, 2024 - Springer
Medical devices used in cardiac diagnostics typically capture only one aspect of heart
function. For instance, 2D B-mode echocardiography reveals the heart's anatomy and …