Deep speech: Scaling up end-to-end speech recognition

Y Lu, J Chai, X Cao - ACM Transactions on Graphics (ToG), 2021 - dl.acm.org

To the best of our knowledge, we first present a live system that generates personalized
photorealistic talking-head animation only driven by audio signals at over 30 fps. Our system …

被引用次数：143 相关文章所有 4 个版本

[PDF] cbml.science

Deep learning: new computational modelling techniques for genomics

G Eraslan, Ž Avsec, J Gagneur, FJ Theis - Nature Reviews Genetics, 2019 - nature.com

As a data-driven science, genomics largely utilizes machine learning to capture
dependencies in data and derive novel biological hypotheses. However, the ability to extract …

被引用次数：984 相关文章所有 6 个版本

[PDF] arxiv.org

Torchaudio: Building blocks for audio and speech processing

YY Yang, M Hira, Z Ni, A Astafurov… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

This document describes version 0.10 of TorchAudio: building blocks for machine learning
applications in the audio and speech processing domain. The objective of TorchAudio is to …

被引用次数：170 相关文章所有 7 个版本

[PDF] thecvf.com

Difftalk: Crafting diffusion models for generalized audio-driven portraits animation

S Shen, W Zhao, Z Meng, W Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

Talking head synthesis is a promising approach for the video production industry. Recently,
a lot of effort has been devoted in this research area to improve the generation quality or …

被引用次数：51 相关文章所有 6 个版本

[PDF] sciencedirect.com

Surrogate modeling for fluid flows based on physics-constrained deep learning without simulation data

L Sun, H Gao, S Pan, JX Wang - Computer Methods in Applied Mechanics …, 2020 - Elsevier

Numerical simulations on fluid dynamics problems primarily rely on spatially or/and
temporally discretization of the governing equation using polynomials into a finite …

被引用次数：732 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] Adversarial attacks and defenses in deep learning

K Ren, T Zheng, Z Qin, X Liu - Engineering, 2020 - Elsevier

With the rapid developments of artificial intelligence (AI) and deep learning (DL) techniques,
it is critical to ensure the security and robustness of the deployed algorithms. Recently, the …

被引用次数：569 相关文章所有 2 个版本

[PDF] thecvf.com

Facial: Synthesizing dynamic talking face with implicit attribute learning

C Zhang, Y Zhao, Y Huang, M Zeng… - Proceedings of the …, 2021 - openaccess.thecvf.com

In this paper, we propose a talking face generation method that takes an audio signal as
input and a short target video clip as reference, and synthesizes a photo-realistic video of …

被引用次数：133 相关文章所有 7 个版本

[HTML] mdpi.com

[HTML][HTML] Review of artificial intelligence and machine learning technologies: classification, restrictions, opportunities and challenges

RI Mukhamediev, Y Popova, Y Kuchin, E Zaitseva… - Mathematics, 2022 - mdpi.com

Artificial intelligence (AI) is an evolving set of technologies used for solving a wide range of
applied issues. The core of AI is machine learning (ML)—a complex of algorithms and …

被引用次数：111 相关文章所有 7 个版本

[PDF] arxiv.org

Neural architecture search: Insights from 1000 papers

C White, M Safari, R Sukthanker, B Ru, T Elsken… - arXiv preprint arXiv …, 2023 - arxiv.org

In the past decade, advances in deep learning have resulted in breakthroughs in a variety of
areas, including computer vision, natural language understanding, speech recognition, and …

被引用次数：71 相关文章所有 2 个版本

[PDF] arxiv.org

Adversarial attacks on deep-learning models in natural language processing: A survey

WE Zhang, QZ Sheng, A Alhazmi, C Li - ACM Transactions on Intelligent …, 2020 - dl.acm.org

With the development of high computational devices, deep neural networks (DNNs), in
recent years, have gained significant popularity in many Artificial Intelligence (AI) …

被引用次数：646 相关文章所有 6 个版本

高级搜索

QQ 群