Speech is silver, silence is golden: What do ASVspoof-trained models really learn?

X Liu, X Wang, M Sahidullah, J Patino… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

Benchmarking initiatives support the meaningful comparison of competing solutions to
prominent problems in speech and language processing. Successive benchmarking …

被引用次数：184 相关文章所有 6 个版本

[PDF] arxiv.org

Does audio deepfake detection generalize?

NM Müller, P Czempin, F Dieckmann… - arXiv preprint arXiv …, 2022 - arxiv.org

Current text-to-speech algorithms produce realistic fakes of human voices, making deepfake
detection a much-needed area of research. While researchers have presented various …

被引用次数：171 相关文章所有 8 个版本

[PDF] acm.org

Human perception of audio deepfakes

NM Müller, K Pizzi, J Williams - … of the 1st International Workshop on …, 2022 - dl.acm.org

The recent emergence of deepfakes has brought manipulated and generated content to the
forefront of machine learning research. Automatic detection of deepfakes has seen many …

被引用次数：74 相关文章所有 6 个版本

A place for (socio) linguistics in audio deepfake detection and discernment: Opportunities for convergence and interdisciplinary collaboration

C Mallinson, VP Janeja, C Evered… - Language and …, 2024 - Wiley Online Library

Deepfakes, particularly audio deepfakes, have become pervasive and pose unique, ever‐
changing threats to society. This paper reviews the current research landscape on audio …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection

L Pham, P Lam, T Nguyen, H Tang, D Tran… - arXiv preprint arXiv …, 2024 - arxiv.org

Thanks to advancements in deep learning, speech generation systems now power a variety
of real-world applications, such as text-to-speech for individuals with speech disorders …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

The impact of silence on speech anti-spoofing

Y Zhang, Z Li, J Lu, H Hua, W Wang… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org

The current speech anti-spoofing countermeasures (CMs) show excellent performance on
specific datasets. However, removing the silence of test speech through Voice Activity …

被引用次数：14 相关文章所有 4 个版本

[PDF] ieee.org

TIMIT-TTS: A text-to-speech dataset for multimodal synthetic media detection

D Salvi, B Hosler, P Bestagini, MC Stamm… - IEEE …, 2023 - ieeexplore.ieee.org

With the rapid development of deep learning techniques, the generation and counterfeiting
of multimedia material has become increasingly simple. Current technology enables the …

被引用次数：35 相关文章所有 8 个版本

[PDF] arxiv.org

Explaining deep learning models for spoofing and deepfake detection with SHapley Additive exPlanations

W Ge, J Patino, M Todisco… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Substantial progress in spoofing and deepfake detection has been made in recent years.
Nonetheless, the community has yet to make notable inroads in providing an explanation for …

被引用次数：42 相关文章所有 5 个版本

[PDF] arxiv.org

Mlaad: The multi-language audio anti-spoofing dataset

NM Müller, P Kawa, WH Choong, E Casanova… - arXiv preprint arXiv …, 2024 - arxiv.org

Text-to-Speech (TTS) technology brings significant advantages, such as giving a voice to
those with speech impairments, but also enables audio deepfakes and spoofs. The former …

被引用次数：35 相关文章所有 2 个版本

[PDF] arxiv.org

Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders

X Wang, J Yamagishi - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org

A good training set for speech spoofing countermeasures requires diverse TTS and VC
spoofing attacks, but generating TTS and VC spoofed trials for a target speaker may be …

被引用次数：37 相关文章所有 3 个版本

高级搜索

QQ 群