查看文章

arxiv.org 中的 [PDF]

Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection

作者

Cai Yu, Shan Jia, Xiaomeng Fu, Jin Liu, Jiahe Tian, Jiao Dai, Xi Wang, Siwei Lyu, Jizhong Han

发表日期

2024/4/30

期刊

arXiv preprint arXiv:2404.19171

简介

With the rising prevalence of deepfakes, there is a growing interest in developing generalizable detection methods for various types of deepfakes. While effective in their specific modalities, traditional detection methods fall short in addressing the generalizability of detection across diverse cross-modal deepfakes. This paper aims to explicitly learn potential cross-modal correlation to enhance deepfake detection towards various generation scenarios. Our approach introduces a correlation distillation task, which models the inherent cross-modal correlation based on content information. This strategy helps to prevent the model from overfitting merely to audio-visual synchronization. Additionally, we present the Cross-Modal Deepfake Dataset (CMDFD), a comprehensive dataset with four generation methods to evaluate the detection of diverse cross-modal deepfakes. The experimental results on CMDFD and FakeAVCeleb datasets demonstrate the superior generalizability of our method over existing state-of-the-art methods. Our code and data can be found at \url{https://github.com/ljj898/CMDFD-Dataset-and-Deepfake-Detection}.

引用总数

被引用次数：1

20241

学术搜索中的文章

Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection

C Yu, S Jia, X Fu, J Liu, J Tian, J Dai, X Wang, S Lyu… - arXiv preprint arXiv:2404.19171, 2024

被引用次数：1 相关文章所有 2 个版本