Audio deepfake detection: A survey

J Yi, C Wang, J Tao, X Zhang, CY Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the …

Add 2022: the first audio deep synthesis detection challenge

J Yi, R Fu, J Tao, S Nie, H Ma, C Wang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Audio deepfake detection is an emerging topic, which was included in the ASVspoof 2021.
However, the recent shared tasks have not covered many real-life and challenging …

An overview of recent work in media forensics: Methods and threats

K Bhagtani, AKS Yadav, ER Bartusiak, Z Xiang… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we review recent work in media forensics for digital images, video, audio
(specifically speech), and documents. For each data modality, we discuss synthesis and …

What to remember: Self-adaptive continual learning for audio deepfake detection

X Zhang, J Yi, C Wang, CY Zhang, S Zeng… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
The rapid evolution of speech synthesis and voice conversion has raised substantial
concerns due to the potential misuse of such technology, prompting a pressing need for …

Do you remember? overcoming catastrophic forgetting for fake audio detection

X Zhang, J Yi, J Tao, C Wang… - … on Machine Learning, 2023 - proceedings.mlr.press
Current fake audio detection algorithms have achieved promising performances on most
datasets. However, their performance may be significantly degraded when dealing with …

The multi-speaker multi-style voice cloning challenge 2021

Q Xie, X Tian, G Liu, K Song, L Xie, Z Wu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
The Multi-speaker Multi-style Voice Cloning Challenge (M2VoC) aims to provide a common
sizable dataset as well as a fair testbed for the benchmarking of the popular voice cloning …

ASSD: Synthetic Speech Detection in the AAC Compressed Domain

AKS Yadav, Z Xiang, ER Bartusiak… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Synthetic human speech signals have become very easy to generate given modern text-to-
speech methods. When these signals are shared on social media they are often …

An overview of recent work in multimedia forensics

K Bhagtani, AKS Yadav, ER Bartusiak… - 2022 IEEE 5th …, 2022 - computer.org
An Overview of Recent Work in Multimedia Forensics Toggle navigation IEEE Computer
Society Digital Library Jobs Tech News Resource Center Press Room Advertising About Us …

Generalizable zero-shot speaker adaptive speech synthesis with disentangled representations

W Wang, Y Song, S Jha - arXiv preprint arXiv:2308.13007, 2023 - arxiv.org
While most research into speech synthesis has focused on synthesizing high-quality speech
for in-dataset speakers, an equally essential yet unsolved problem is synthesizing speech …

[PDF][PDF] Synthetic speech attribution using self supervised audio spectrogram transformer

AKS Yadav, ER Bartusiak, K Bhagtani… - Electronic …, 2023 - library.imaging.org
The ability to synthesize convincing human speech has become easier due to the
availability of speech generation tools. This necessitates the development of forensics …