Generalized video anomaly event detection: Systematic taxonomy and comparison of deep models

Y Liu, D Yang, Y Wang, J Liu, J Liu… - ACM Computing …, 2024 - dl.acm.org
Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance
systems, enabling the temporal or spatial identification of anomalous events within videos …

How2comm: Communication-efficient and collaboration-pragmatic multi-agent perception

D Yang, K Yang, Y Wang, J Liu, Z Xu… - Advances in …, 2024 - proceedings.neurips.cc
Multi-agent collaborative perception has recently received widespread attention as an
emerging application in driving scenarios. Despite the advancements in previous efforts …

Robust emotion recognition in context debiasing

D Yang, K Yang, M Li, S Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Context-aware emotion recognition (CAER) has recently boosted the practical applications
of affective computing techniques in unconstrained environments. Mainstream CAER …

What2comm: Towards communication-efficient collaborative perception via feature decoupling

K Yang, D Yang, J Zhang, H Wang, P Sun… - Proceedings of the 31st …, 2023 - dl.acm.org
Multi-agent collaborative perception has received increasing attention recently as an
emerging application in driving scenarios. Despite advancements in previous approaches …

Learning causality-inspired representation consistency for video anomaly detection

Y Liu, Z Xia, M Zhao, D Wei, Y Wang, S Liu… - Proceedings of the 31st …, 2023 - dl.acm.org
Video anomaly detection is an essential yet challenging task in the multimedia community,
with promising applications in smart cities and secure communities. Existing methods …

Efficient decision-based black-box patch attacks on video recognition

K Jiang, Z Chen, H Huang, J Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Although Deep Neural Networks (DNNs) have demonstrated excellent
performance, they are vulnerable to adversarial patches that introduce perceptible and …

Sampling to distill: Knowledge transfer from open-world data

Y Wang, Z Chen, J Zhang, D Yang, Z Ge, Y Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
Data-Free Knowledge Distillation (DFKD) is a novel task that aims to train high-performance
student models using only the teacher network without original training data. Despite …

Efficiency in focus: Layernorm as a catalyst for fine-tuning medical visual language pre-trained models

J Chen, D Yang, Y Jiang, M Li, J Wei, X Hou… - arXiv preprint arXiv …, 2024 - arxiv.org
In the realm of Medical Visual Language Models (Med-VLMs), the quest for universal
efficient fine-tuning mechanisms remains paramount, especially given researchers in …

Towards Multimodal Human Intention Understanding Debiasing via Subject-Deconfounding

D Yang, D Xiao, K Li, Y Wang, Z Chen, J Wei… - arXiv preprint arXiv …, 2024 - arxiv.org
Multimodal intention understanding (MIU) is an indispensable component of human
expression analysis (eg, sentiment or humor) from heterogeneous modalities, including …

Towards multimodal sentiment analysis debiasing via bias purification

D Yang, M Li, D Xiao, Y Liu, K Yang, Z Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Multimodal Sentiment Analysis (MSA) aims to understand human intentions by integrating
emotion-related clues from diverse modalities, such as visual, language, and audio …