Aide: A vision-driven multi-view, multi-modal, multi-tasking dataset for assistive driving...

Y Liu, D Yang, Y Wang, J Liu, J Liu… - ACM Computing …, 2024 - dl.acm.org

Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance
systems, enabling the temporal or spatial identification of anomalous events within videos …

被引用次数：42 相关文章所有 3 个版本

[PDF] neurips.cc

How2comm: Communication-efficient and collaboration-pragmatic multi-agent perception

D Yang, K Yang, Y Wang, J Liu, Z Xu… - Advances in …, 2024 - proceedings.neurips.cc

Multi-agent collaborative perception has recently received widespread attention as an
emerging application in driving scenarios. Despite the advancements in previous efforts …

被引用次数：15 相关文章所有 5 个版本

[PDF] thecvf.com

Robust emotion recognition in context debiasing

D Yang, K Yang, M Li, S Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Context-aware emotion recognition (CAER) has recently boosted the practical applications
of affective computing techniques in unconstrained environments. Mainstream CAER …

被引用次数：6 相关文章所有 4 个版本

What2comm: Towards communication-efficient collaborative perception via feature decoupling

K Yang, D Yang, J Zhang, H Wang, P Sun… - Proceedings of the 31st …, 2023 - dl.acm.org

Multi-agent collaborative perception has received increasing attention recently as an
emerging application in driving scenarios. Despite advancements in previous approaches …

被引用次数：18 相关文章

[PDF] arxiv.org

Learning causality-inspired representation consistency for video anomaly detection

Y Liu, Z Xia, M Zhao, D Wei, Y Wang, S Liu… - Proceedings of the 31st …, 2023 - dl.acm.org

Video anomaly detection is an essential yet challenging task in the multimedia community,
with promising applications in smart cities and secure communities. Existing methods …

被引用次数：10 相关文章所有 3 个版本

[PDF] thecvf.com

Efficient decision-based black-box patch attacks on video recognition

K Jiang, Z Chen, H Huang, J Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Although Deep Neural Networks (DNNs) have demonstrated excellent
performance, they are vulnerable to adversarial patches that introduce perceptible and …

被引用次数：11 相关文章所有 5 个版本

[PDF] arxiv.org

Sampling to distill: Knowledge transfer from open-world data

Y Wang, Z Chen, J Zhang, D Yang, Z Ge, Y Liu… - arXiv preprint arXiv …, 2023 - arxiv.org

Data-Free Knowledge Distillation (DFKD) is a novel task that aims to train high-performance
student models using only the teacher network without original training data. Despite …

被引用次数：9 相关文章所有 3 个版本

[PDF] arxiv.org

Efficiency in focus: Layernorm as a catalyst for fine-tuning medical visual language pre-trained models

J Chen, D Yang, Y Jiang, M Li, J Wei, X Hou… - arXiv preprint arXiv …, 2024 - arxiv.org

In the realm of Medical Visual Language Models (Med-VLMs), the quest for universal
efficient fine-tuning mechanisms remains paramount, especially given researchers in …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Towards Multimodal Human Intention Understanding Debiasing via Subject-Deconfounding

D Yang, D Xiao, K Li, Y Wang, Z Chen, J Wei… - arXiv preprint arXiv …, 2024 - arxiv.org

Multimodal intention understanding (MIU) is an indispensable component of human
expression analysis (eg, sentiment or humor) from heterogeneous modalities, including …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Towards multimodal sentiment analysis debiasing via bias purification

D Yang, M Li, D Xiao, Y Liu, K Yang, Z Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

Multimodal Sentiment Analysis (MSA) aims to understand human intentions by integrating
emotion-related clues from diverse modalities, such as visual, language, and audio …

被引用次数：4 相关文章所有 3 个版本

高级搜索

QQ 群