Learning imbalanced data with vision transformers

Z Xu, R Liu, S Yang, Z Chai… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The real-world data tends to be heavily imbalanced and severely skew the data-driven deep
neural networks, which makes Long-Tailed Recognition (LTR) a massive challenging task …

Symbol-LLM: leverage language models for symbolic system in visual human activity reasoning

X Wu, YL Li, J Sun, C Lu - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Human reasoning can be understood as a cooperation between the intuitive, associative"
System-1''and the deliberative, logical" System-2''. For existing System-1-like methods in …

Meta-causal feature learning for out-of-distribution generalization

Y Wang, X Li, Z Qi, J Li, X Li, X Meng… - European Conference on …, 2022 - Springer
Causal inference has become a powerful tool to handle the out-of-distribution (OOD)
generalization problem, which aims to extract the invariant features. However, conventional …

Mdcs: More diverse experts with consistency self-distillation for long-tailed recognition

Q Zhao, C Jiang, W Hu, F Zhang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Recently, multi-expert methods have led to significant improvements in long-tail recognition
(LTR). We summarize two aspects that need further enhancement to contribute to LTR …

Class-level Structural Relation Modeling and Smoothing for Visual Representation Learning

Z Chen, Z Qi, X Cao, X Li, X Meng, L Meng - Proceedings of the 31st …, 2023 - dl.acm.org
Representation learning for images has been advanced by recent progress in more complex
neural models such as the Vision Transformers and new learning theories such as the …

SandGAN: Style-Mix Assisted Noise Distortion for Imbalanced Conditional Image Synthesis

H Liu, Y Endo, J Lee, S Kamijo - Neurocomputing, 2023 - Elsevier
Abstract Conditional Generative Adversarial Networks (CGANs) are well developed on
balanced datasets as default standards for generating high-quality images of expected …

Deep Imbalanced Regression via Hierarchical Classification Adjustment

H Xiong, A Yao - Proceedings of the IEEE/CVF Conference …, 2024 - openaccess.thecvf.com
Regression tasks in computer vision such as age estimation or counting are often formulated
into classification by quantizing the target space into classes. Yet real-world data is often …

SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification

S Huang, B Zhang, B Shi, H Li, Y Li, P Gao - Proceedings of the 31st …, 2023 - dl.acm.org
Although Domain Generalization (DG) problem has been fast-growing in the 2D image
tasks, its exploration on 3D point cloud data is still insufficient and challenged by more …

Distill gold from massive ores: Efficient dataset distillation via critical samples selection

Y Xu, YL Li, K Cui, Z Wang, C Lu, YW Tai… - arXiv preprint arXiv …, 2023 - arxiv.org
Data-efficient learning has garnered significant attention, especially given the current trend
of large multi-modal models. Recently, dataset distillation becomes an effective approach for …

DBN-Mix: Training dual branch network using bilateral mixup augmentation for long-tailed visual recognition

JS Baik, IY Yoon, JW Choi - Pattern Recognition, 2024 - Elsevier
There is growing interest in the challenging visual perception task of learning from long-
tailed class distributions. The extreme class imbalance in the training dataset biases the …