MARLIN: Masked Autoencoder for Facial Video Representation LearnINg Z Cai, S Ghosh, K Stefanov, A Dhall, J Cai, H Rezatofighi, R Haffari, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 44 | 2023 |
Do you really mean that? Content driven audio-visual deepfake dataset and multimodal method for temporal forgery localization Z Cai, K Stefanov, A Dhall, M Hayat 2022 International Conference on Digital Image Computing: Techniques and …, 2022 | 29 | 2022 |
Glitch in the matrix: A large scale benchmark for content driven audio–visual forgery detection and localization Z Cai, S Ghosh, A Dhall, T Gedeon, K Stefanov, M Hayat Computer Vision and Image Understanding 236, 103818, 2023 | 8 | 2023 |
AV-Deepfake1M: A large-scale LLM-driven audio-visual deepfake dataset Z Cai, S Ghosh, AP Adatia, M Hayat, A Dhall, K Stefanov arXiv preprint arXiv:2311.15308, 2023 | 6 | 2023 |
Emolysis: A multimodal open-source group emotion analysis and visualization toolkit S Ghosh, Z Cai, P Gupta, G Sharma, A Dhall, M Hayat, T Gedeon arXiv preprint arXiv:2305.05255, 2023 | 1 | 2023 |
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning F Ke, Z Cai, S Jahangard, W Wang, PD Haghighi, H Rezatofighi arXiv preprint arXiv:2403.12884, 2024 | | 2024 |
JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups S Jahangard, Z Cai, S Wen, H Rezatofighi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |
Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase S Ghosh, R Hasan, P Agrawal, Z Cai, S Soon, A Dhall, T Gedeon arXiv preprint arXiv:2305.06110, 2023 | | 2023 |