Navigating the pitfalls of applying machine learning in genomics

S Whalen, J Schreiber, WS Noble… - Nature Reviews Genetics, 2022 - nature.com
The scale of genetic, epigenomic, transcriptomic, cheminformatic and proteomic data
available today, coupled with easy-to-use machine learning (ML) toolkits, has propelled the …

Imbalance problems in object detection: A review

K Oksuz, BC Cam, S Kalkan… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
In this paper, we present a comprehensive review of the imbalance problems in object
detection. To analyze the problems in a systematic manner, we introduce a problem-based …

Deep long-tailed learning: A survey

Y Zhang, B Kang, B Hooi, S Yan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deep long-tailed learning, one of the most challenging problems in visual recognition, aims
to train well-performing deep models from a large number of images that follow a long-tailed …

Asymmetric loss for multi-label classification

T Ridnik, E Ben-Baruch, N Zamir… - Proceedings of the …, 2021 - openaccess.thecvf.com
In a typical multi-label setting, a picture contains on average few positive labels, and many
negative ones. This positive-negative imbalance dominates the optimization process, and …

Balanced contrastive learning for long-tailed visual recognition

J Zhu, Z Wang, J Chen, YPP Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Real-world data typically follow a long-tailed distribution, where a few majority categories
occupy most of the data while most minority categories contain a limited number of samples …

Simple copy-paste is a strong data augmentation method for instance segmentation

G Ghiasi, Y Cui, A Srinivas, R Qian… - Proceedings of the …, 2021 - openaccess.thecvf.com
Building instance segmentation models that are data-efficient and can handle rare object
categories is an important challenge in computer vision. Leveraging data augmentations is a …

Parametric contrastive learning

J Cui, Z Zhong, S Liu, B Yu… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
In this paper, we propose Parametric Contrastive Learning (PaCo) to tackle long-tailed
recognition. Based on theoretical analysis, we observe supervised contrastive loss tends to …

Fine-grained image analysis with deep learning: A survey

XS Wei, YZ Song, O Mac Aodha, J Wu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …

Long-tailed recognition via weight balancing

S Alshammari, YX Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
In the real open world, data tends to follow long-tailed class distributions, motivating the well-
studied long-tailed recognition (LTR) problem. Naive training produces models that are …

Spot-the-difference self-supervised pre-training for anomaly detection and segmentation

Y Zou, J Jeong, L Pemula, D Zhang… - European Conference on …, 2022 - Springer
Visual anomaly detection is commonly used in industrial quality inspection. In this paper, we
present a new dataset as well as a new self-supervised learning method for ImageNet pre …