Prosody and voice factorization for few-shot speaker adaptation in the challenge m2voc 2021

J Yi, C Wang, J Tao, X Zhang, CY Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the …

被引用次数：44 相关文章所有 4 个版本

[PDF] arxiv.org

Add 2022: the first audio deep synthesis detection challenge

J Yi, R Fu, J Tao, S Nie, H Ma, C Wang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Audio deepfake detection is an emerging topic, which was included in the ASVspoof 2021.
However, the recent shared tasks have not covered many real-life and challenging …

被引用次数：183 相关文章所有 9 个版本

[PDF] arxiv.org

An overview of recent work in media forensics: Methods and threats

K Bhagtani, AKS Yadav, ER Bartusiak, Z Xiang… - arXiv preprint arXiv …, 2022 - arxiv.org

In this paper, we review recent work in media forensics for digital images, video, audio
(specifically speech), and documents. For each data modality, we discuss synthesis and …

被引用次数：31 相关文章所有 2 个版本

[PDF] aaai.org

What to remember: Self-adaptive continual learning for audio deepfake detection

X Zhang, J Yi, C Wang, CY Zhang, S Zeng… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

The rapid evolution of speech synthesis and voice conversion has raised substantial
concerns due to the potential misuse of such technology, prompting a pressing need for …

被引用次数：12 相关文章所有 5 个版本

[PDF] mlr.press

Do you remember? overcoming catastrophic forgetting for fake audio detection

X Zhang, J Yi, J Tao, C Wang… - … on Machine Learning, 2023 - proceedings.mlr.press

Current fake audio detection algorithms have achieved promising performances on most
datasets. However, their performance may be significantly degraded when dealing with …

被引用次数：18 相关文章所有 7 个版本

[PDF] arxiv.org

The multi-speaker multi-style voice cloning challenge 2021

Q Xie, X Tian, G Liu, K Song, L Xie, Z Wu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

The Multi-speaker Multi-style Voice Cloning Challenge (M2VoC) aims to provide a common
sizable dataset as well as a fair testbed for the benchmarking of the popular voice cloning …

被引用次数：39 相关文章所有 3 个版本

[PDF] sigport.org

ASSD: Synthetic Speech Detection in the AAC Compressed Domain

AKS Yadav, Z Xiang, ER Bartusiak… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Synthetic human speech signals have become very easy to generate given modern text-to-
speech methods. When these signals are shared on social media they are often …

被引用次数：12 相关文章所有 2 个版本

An overview of recent work in multimedia forensics

K Bhagtani, AKS Yadav, ER Bartusiak… - 2022 IEEE 5th …, 2022 - computer.org

An Overview of Recent Work in Multimedia Forensics Toggle navigation IEEE Computer
Society Digital Library Jobs Tech News Resource Center Press Room Advertising About Us …

被引用次数：19 相关文章所有 2 个版本

[PDF] arxiv.org

Generalizable zero-shot speaker adaptive speech synthesis with disentangled representations

W Wang, Y Song, S Jha - arXiv preprint arXiv:2308.13007, 2023 - arxiv.org

While most research into speech synthesis has focused on synthesizing high-quality speech
for in-dataset speakers, an equally essential yet unsolved problem is synthesizing speech …

被引用次数：6 相关文章所有 4 个版本

[PDF] imaging.org

[PDF][PDF] Synthetic speech attribution using self supervised audio spectrogram transformer

AKS Yadav, ER Bartusiak, K Bhagtani… - Electronic …, 2023 - library.imaging.org

The ability to synthesize convincing human speech has become easier due to the
availability of speech generation tools. This necessitates the development of forensics …

被引用次数：17 相关文章

高级搜索

QQ 群