Stablerep: Synthetic images from text-to-image models make strong visual representation learners

Y Tian, L Fan, P Isola, H Chang… - Advances in Neural …, 2024 - proceedings.neurips.cc
We investigate the potential of learning visual representations using synthetic images
generated by text-to-image models. This is a natural question in the light of the excellent …

Scaling laws of synthetic images for model training... for now

L Fan, K Chen, D Krishnan, D Katabi… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent significant advances in text-to-image models unlock the possibility of training vision
systems using synthetic images potentially overcoming the difficulty of collecting curated …

Improving multimodal datasets with image captioning

T Nguyen, SY Gadre, G Ilharco… - Advances in Neural …, 2024 - proceedings.neurips.cc
Massive web datasets play a key role in the success of large vision-language models like
CLIP and Flamingo. However, the raw web data is noisy, and existing filtering methods to …

Learning vision from models rivals learning vision from data

Y Tian, L Fan, K Chen, D Katabi… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce SynCLR a novel approach for learning visual representations exclusively from
synthetic images without any real data. We synthesize a large dataset of image captions …

Dream the impossible: Outlier imagination with diffusion models

X Du, Y Sun, J Zhu, Y Li - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Utilizing auxiliary outlier datasets to regularize the machine learning model has
demonstrated promise for out-of-distribution (OOD) detection and safe prediction. Due to the …

Self-consuming generative models go mad

S Alemohammad, J Casco-Rodriguez, L Luzi… - arXiv preprint arXiv …, 2023 - arxiv.org
Seismic advances in generative AI algorithms for imagery, text, and other data types has led
to the temptation to use synthetic data to train next-generation models. Repeating this …

Dataset diffusion: Diffusion-based synthetic data generation for pixel-level semantic segmentation

Q Nguyen, T Vu, A Tran… - Advances in Neural …, 2024 - proceedings.neurips.cc
Preparing training data for deep vision models is a labor-intensive task. To address this,
generative models have emerged as an effective solution for generating synthetic data …

Freemask: Synthetic images with dense annotations make stronger segmentation models

L Yang, X Xu, B Kang, Y Shi… - Advances in Neural …, 2024 - proceedings.neurips.cc
Semantic segmentation has witnessed tremendous progress due to the proposal of various
advanced network architectures. However, they are extremely hungry for delicate …

Dreamsim: Learning new dimensions of human visual similarity using synthetic data

S Fu, N Tamir, S Sundaram, L Chai, R Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Current perceptual similarity metrics operate at the level of pixels and patches. These
metrics compare images in terms of their low-level colors and textures, but fail to capture mid …

Leaving reality to imagination: Robust classification via generated datasets

H Bansal, A Grover - arXiv preprint arXiv:2302.02503, 2023 - arxiv.org
Recent research on robustness has revealed significant performance gaps between neural
image classifiers trained on datasets that are similar to the test set, and those that are from a …