Tencent ml-images: A large-scale multi-label image database for visual representation learning

B Wu, W Chen, Y Fan, Y Zhang, J Hou, J Liu… - IEEE …, 2019 - ieeexplore.ieee.org
In existing visual representation learning tasks, deep convolutional neural networks (CNNs)
are often trained on images annotated with single tag, such as ImageNet. However, single …

Co2: Consistent contrast for unsupervised visual representation learning

C Wei, H Wang, W Shen, A Yuille - arXiv preprint arXiv:2010.02217, 2020 - arxiv.org
Contrastive learning has been adopted as a core method for unsupervised visual
representation learning. Without human annotation, the common practice is to perform an …

A simple framework for contrastive learning of visual representations

T Chen, S Kornblith, M Norouzi… - … conference on machine …, 2020 - proceedings.mlr.press
This paper presents SimCLR: a simple framework for contrastive learning of visual
representations. We simplify recently proposed contrastive self-supervised learning …

Object-centric representation learning from unlabeled videos

R Gao, D Jayaraman, K Grauman - … 20-24, 2016, Revised Selected Papers …, 2017 - Springer
Supervised (pre-) training currently yields state-of-the-art performance for representation
learning for visual recognition, yet it comes at the cost of (1) intensive manual annotations …

Difficulty-based sampling for debiased contrastive representation learning

T Jang, X Wang - Proceedings of the IEEE/CVF Conference …, 2023 - openaccess.thecvf.com
Contrastive learning is a self-supervised representation learning method that achieves
milestone performance in various classification tasks. However, due to its unsupervised …

Network deconvolution

C Ye, M Evanusa, H He, A Mitrokhin… - arXiv preprint arXiv …, 2019 - arxiv.org
Convolution is a central operation in Convolutional Neural Networks (CNNs), which applies
a kernel to overlapping regions shifted across the image. However, because of the strong …

Self-supervised representation learning by predicting visual permutations

Q Zhao, J Dong - Knowledge-Based Systems, 2020 - Elsevier
We propose a self-supervised learning method to uncover the spatial or temporal structure
of visual data by identifying the position of a patch within an image or the position of a video …

Lightweight pixel difference networks for efficient visual representation learning

Z Su, J Zhang, L Wang, H Zhang, Z Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Recently, there have been tremendous efforts in developing lightweight Deep Neural
Networks (DNNs) with satisfactory accuracy, which can enable the ubiquitous deployment of …

Unsupervised object-level representation learning from scene images

J Xie, X Zhan, Z Liu, YS Ong… - Advances in Neural …, 2021 - proceedings.neurips.cc
Contrastive self-supervised learning has largely narrowed the gap to supervised pre-training
on ImageNet. However, its success highly relies on the object-centric priors of ImageNet, ie …

Transfer learning in computer vision tasks: Remember where you come from

X Li, Y Grandvalet, F Davoine, J Cheng, Y Cui… - Image and Vision …, 2020 - Elsevier
Fine-tuning pre-trained deep networks is a practical way of benefiting from the
representation learned on a large database while having relatively few examples to train a …