Asvspoof 2021: Towards spoofed and deepfake speech detection in the wild

X Liu, X Wang, M Sahidullah, J Patino… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
Benchmarking initiatives support the meaningful comparison of competing solutions to
prominent problems in speech and language processing. Successive benchmarking …

Does audio deepfake detection generalize?

NM Müller, P Czempin, F Dieckmann… - arXiv preprint arXiv …, 2022 - arxiv.org
Current text-to-speech algorithms produce realistic fakes of human voices, making deepfake
detection a much-needed area of research. While researchers have presented various …

Human perception of audio deepfakes

NM Müller, K Pizzi, J Williams - … of the 1st International Workshop on …, 2022 - dl.acm.org
The recent emergence of deepfakes has brought manipulated and generated content to the
forefront of machine learning research. Automatic detection of deepfakes has seen many …

A place for (socio) linguistics in audio deepfake detection and discernment: Opportunities for convergence and interdisciplinary collaboration

C Mallinson, VP Janeja, C Evered… - Language and …, 2024 - Wiley Online Library
Deepfakes, particularly audio deepfakes, have become pervasive and pose unique, ever‐
changing threats to society. This paper reviews the current research landscape on audio …

A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection

L Pham, P Lam, T Nguyen, H Tang, D Tran… - arXiv preprint arXiv …, 2024 - arxiv.org
Thanks to advancements in deep learning, speech generation systems now power a variety
of real-world applications, such as text-to-speech for individuals with speech disorders …

The impact of silence on speech anti-spoofing

Y Zhang, Z Li, J Lu, H Hua, W Wang… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org
The current speech anti-spoofing countermeasures (CMs) show excellent performance on
specific datasets. However, removing the silence of test speech through Voice Activity …

TIMIT-TTS: A text-to-speech dataset for multimodal synthetic media detection

D Salvi, B Hosler, P Bestagini, MC Stamm… - IEEE …, 2023 - ieeexplore.ieee.org
With the rapid development of deep learning techniques, the generation and counterfeiting
of multimedia material has become increasingly simple. Current technology enables the …

Explaining deep learning models for spoofing and deepfake detection with SHapley Additive exPlanations

W Ge, J Patino, M Todisco… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Substantial progress in spoofing and deepfake detection has been made in recent years.
Nonetheless, the community has yet to make notable inroads in providing an explanation for …

Mlaad: The multi-language audio anti-spoofing dataset

NM Müller, P Kawa, WH Choong, E Casanova… - arXiv preprint arXiv …, 2024 - arxiv.org
Text-to-Speech (TTS) technology brings significant advantages, such as giving a voice to
those with speech impairments, but also enables audio deepfakes and spoofs. The former …

Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders

X Wang, J Yamagishi - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
A good training set for speech spoofing countermeasures requires diverse TTS and VC
spoofing attacks, but generating TTS and VC spoofed trials for a target speaker may be …