Towards bridging event captioner and sentence localizer for weakly supervised dense event captioning

S Chen, YG Jiang - … of the IEEE/CVF Conference on …, 2021 - openaccess.thecvf.com
Abstract Dense Event Captioning (DEC) aims to jointly localize and describe multiple events
of interest in untrimmed videos, which is an advancement of the conventional video …

Certified robustness against natural language attacks by causal intervention

H Zhao, C Ma, X Dong, AT Luu… - International …, 2022 - proceedings.mlr.press
Deep learning models have achieved great success in many fields, yet they are vulnerable
to adversarial examples. This paper follows a causal perspective to look into the adversarial …

Imbsam: A closer look at sharpness-aware minimization in class-imbalanced recognition

Y Zhou, Y Qu, X Xu, H Shen - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Class imbalance is a common challenge in real-world recognition tasks, where the majority
of classes have few samples, also known as tail classes. We address this challenge with the …

A re-balancing strategy for class-imbalanced classification based on instance difficulty

S Yu, J Guo, R Zhang, Y Fan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Real-world data often exhibits class-imbalanced distributions, where a few classes (aka
majority classes) occupy most instances and lots of classes (aka minority classes) have few …

Exploring vision-language models for imbalanced learning

Y Wang, Z Yu, J Wang, Q Heng, H Chen, W Ye… - International Journal of …, 2024 - Springer
Vision-language models (VLMs) that use contrastive language-image pre-training have
shown promising zero-shot classification performance. However, their performance on …

Learning imbalanced data with vision transformers

Z Xu, R Liu, S Yang, Z Chai… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The real-world data tends to be heavily imbalanced and severely skew the data-driven deep
neural networks, which makes Long-Tailed Recognition (LTR) a massive challenging task …

Backdoor defense via deconfounded representation learning

Z Zhang, Q Liu, Z Wang, Z Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Deep neural networks (DNNs) are recently shown to be vulnerable to backdoor attacks,
where attackers embed hidden backdoors in the DNN model by injecting a few poisoned …

On model calibration for long-tailed object detection and instance segmentation

TY Pan, C Zhang, Y Li, H Hu, D Xuan… - Advances in …, 2021 - proceedings.neurips.cc
Vanilla models for object detection and instance segmentation suffer from the heavy bias
toward detecting frequent objects in the long-tailed setting. Existing methods address this …

Generalized logit adjustment: Calibrating fine-tuned models by removing label bias in foundation models

B Zhu, K Tang, Q Sun, H Zhang - Advances in Neural …, 2024 - proceedings.neurips.cc
Foundation models like CLIP allow zero-shot transfer on various tasks without additional
training data. Yet, the zero-shot performance is less competitive than a fully supervised one …

Forest r-cnn: Large-vocabulary long-tailed object detection and instance segmentation

J Wu, L Song, T Wang, Q Zhang, J Yuan - Proceedings of the 28th ACM …, 2020 - dl.acm.org
Despite the previous success of object analysis, detecting and segmenting a large number
of object categories with a long-tailed data distribution remains a challenging problem and is …