Improving pixel-based mim by reducing wasted modeling capability

Y Liu, S Zhang, J Chen, Z Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com
There has been significant progress in Masked Image Modeling (MIM). Existing MIM
methods can be broadly categorized into two groups based on the reconstruction target …

Pixmim: Rethinking pixel reconstruction in masked image modeling

Y Liu, S Zhang, J Chen, K Chen, D Lin - arXiv preprint arXiv:2303.02416, 2023 - arxiv.org
Masked Image Modeling (MIM) has achieved promising progress with the advent of Masked
Autoencoders (MAE) and BEiT. However, subsequent works have complicated the …

Controllable augmentations for video representation learning

R Qian, W Lin, J See, D Li - Visual Intelligence, 2024 - Springer
This paper focuses on self-supervised video representation learning. Most existing
approaches follow the contrastive learning pipeline to construct positive and negative pairs …