Probabilistic contrastive learning for long-tailed visual recognition

C Du, Y Wang, S Song, G Huang - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Long-tailed distributions frequently emerge in real-world data, where a large number of
minority categories contain a limited number of samples. Such imbalance issue …

Balanced knowledge distillation for long-tailed learning

S Zhang, C Chen, X Hu, S Peng - Neurocomputing, 2023 - Elsevier
Deep models trained on long-tailed datasets exhibit unsatisfactory performance on tail
classes. Existing methods usually modify the classification loss to increase the learning …

De-biasing distantly supervised named entity recognition via causal intervention

W Zhang, H Lin, X Han, L Sun - arXiv preprint arXiv:2106.09233, 2021 - arxiv.org
Distant supervision tackles the data bottleneck in NER by automatically generating training
instances via dictionary matching. Unfortunately, the learning of DS-NER is severely …

Deconfounded visual grounding

J Huang, Y Qin, J Qi, Q Sun, H Zhang - Proceedings of the AAAI …, 2022 - ojs.aaai.org
We focus on the confounding bias between language and location in the visual grounding
pipeline, where we find that the bias is the major visual reasoning bottleneck. For example …

Videolt: Large-scale long-tailed video recognition

X Zhang, Z Wu, Z Weng, H Fu, J Chen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Label distributions in real-world are oftentimes long-tailed and imbalanced, resulting in
biased models towards dominant labels. While long-tailed recognition has been extensively …

Causal unsupervised semantic segmentation

J Kim, BK Lee, YM Ro - arXiv preprint arXiv:2310.07379, 2023 - arxiv.org
Unsupervised semantic segmentation aims to achieve high-quality semantic grouping
without human-labeled annotations. With the advent of self-supervised pre-training, various …

Contextual debiasing for visual recognition with causal mechanisms

R Liu, H Liu, G Li, H Hou, TH Yu… - Proceedings of the …, 2022 - openaccess.thecvf.com
As a common problem in the visual world, contextual bias means the recognition may
depend on the co-occurrence context rather than the objects themselves, which is even …

Visual causal scene refinement for video question answering

Y Wei, Y Liu, H Yan, G Li, L Lin - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Existing methods for video question answering (VideoQA) often suffer from spurious
correlations between different modalities, leading to a failure in identifying the dominant …

A simple long-tailed recognition baseline via vision-language model

T Ma, S Geng, M Wang, J Shao, J Lu, H Li… - arXiv preprint arXiv …, 2021 - arxiv.org
The visual world naturally exhibits a long-tailed distribution of open classes, which poses
great challenges to modern visual systems. Existing approaches either perform class re …

Mosaicos: a simple and effective use of object-centric images for long-tailed object detection

C Zhang, TY Pan, Y Li, H Hu, D Xuan… - Proceedings of the …, 2021 - openaccess.thecvf.com
Many objects do not appear frequently enough in complex scenes (eg, certain handbags in
living rooms) for training an accurate object detector, but are often found frequently by …