关注
Zhixi Cai
Zhixi Cai
PhD Student at Monash University
在 monash.edu 的电子邮件经过验证
标题
引用次数
引用次数
年份
MARLIN: Masked Autoencoder for Facial Video Representation LearnINg
Z Cai, S Ghosh, K Stefanov, A Dhall, J Cai, H Rezatofighi, R Haffari, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
442023
Do you really mean that? Content driven audio-visual deepfake dataset and multimodal method for temporal forgery localization
Z Cai, K Stefanov, A Dhall, M Hayat
2022 International Conference on Digital Image Computing: Techniques and …, 2022
292022
Glitch in the matrix: A large scale benchmark for content driven audio–visual forgery detection and localization
Z Cai, S Ghosh, A Dhall, T Gedeon, K Stefanov, M Hayat
Computer Vision and Image Understanding 236, 103818, 2023
82023
AV-Deepfake1M: A large-scale LLM-driven audio-visual deepfake dataset
Z Cai, S Ghosh, AP Adatia, M Hayat, A Dhall, K Stefanov
arXiv preprint arXiv:2311.15308, 2023
62023
Emolysis: A multimodal open-source group emotion analysis and visualization toolkit
S Ghosh, Z Cai, P Gupta, G Sharma, A Dhall, M Hayat, T Gedeon
arXiv preprint arXiv:2305.05255, 2023
12023
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
F Ke, Z Cai, S Jahangard, W Wang, PD Haghighi, H Rezatofighi
arXiv preprint arXiv:2403.12884, 2024
2024
JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
S Jahangard, Z Cai, S Wen, H Rezatofighi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
2024
Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase
S Ghosh, R Hasan, P Agrawal, Z Cai, S Soon, A Dhall, T Gedeon
arXiv preprint arXiv:2305.06110, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–8