Live speech portraits: real-time photorealistic talking-head animation

Y Lu, J Chai, X Cao - ACM Transactions on Graphics (ToG), 2021 - dl.acm.org
To the best of our knowledge, we first present a live system that generates personalized
photorealistic talking-head animation only driven by audio signals at over 30 fps. Our system …

Deep learning: new computational modelling techniques for genomics

G Eraslan, Ž Avsec, J Gagneur, FJ Theis - Nature Reviews Genetics, 2019 - nature.com
As a data-driven science, genomics largely utilizes machine learning to capture
dependencies in data and derive novel biological hypotheses. However, the ability to extract …

Torchaudio: Building blocks for audio and speech processing

YY Yang, M Hira, Z Ni, A Astafurov… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
This document describes version 0.10 of TorchAudio: building blocks for machine learning
applications in the audio and speech processing domain. The objective of TorchAudio is to …

Difftalk: Crafting diffusion models for generalized audio-driven portraits animation

S Shen, W Zhao, Z Meng, W Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Talking head synthesis is a promising approach for the video production industry. Recently,
a lot of effort has been devoted in this research area to improve the generation quality or …

Surrogate modeling for fluid flows based on physics-constrained deep learning without simulation data

L Sun, H Gao, S Pan, JX Wang - Computer Methods in Applied Mechanics …, 2020 - Elsevier
Numerical simulations on fluid dynamics problems primarily rely on spatially or/and
temporally discretization of the governing equation using polynomials into a finite …

[HTML][HTML] Adversarial attacks and defenses in deep learning

K Ren, T Zheng, Z Qin, X Liu - Engineering, 2020 - Elsevier
With the rapid developments of artificial intelligence (AI) and deep learning (DL) techniques,
it is critical to ensure the security and robustness of the deployed algorithms. Recently, the …

Facial: Synthesizing dynamic talking face with implicit attribute learning

C Zhang, Y Zhao, Y Huang, M Zeng… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we propose a talking face generation method that takes an audio signal as
input and a short target video clip as reference, and synthesizes a photo-realistic video of …

[HTML][HTML] Review of artificial intelligence and machine learning technologies: classification, restrictions, opportunities and challenges

RI Mukhamediev, Y Popova, Y Kuchin, E Zaitseva… - Mathematics, 2022 - mdpi.com
Artificial intelligence (AI) is an evolving set of technologies used for solving a wide range of
applied issues. The core of AI is machine learning (ML)—a complex of algorithms and …

Neural architecture search: Insights from 1000 papers

C White, M Safari, R Sukthanker, B Ru, T Elsken… - arXiv preprint arXiv …, 2023 - arxiv.org
In the past decade, advances in deep learning have resulted in breakthroughs in a variety of
areas, including computer vision, natural language understanding, speech recognition, and …

Adversarial attacks on deep-learning models in natural language processing: A survey

WE Zhang, QZ Sheng, A Alhazmi, C Li - ACM Transactions on Intelligent …, 2020 - dl.acm.org
With the development of high computational devices, deep neural networks (DNNs), in
recent years, have gained significant popularity in many Artificial Intelligence (AI) …