Learning vision from models rivals learning vision from data

Y Tian, L Fan, K Chen, D Katabi… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce SynCLR a novel approach for learning visual representations exclusively from
synthetic images without any real data. We synthesize a large dataset of image captions …

A comprehensive survey for generative data augmentation

Y Chen, Z Yan, Y Zhu - Neurocomputing, 2024 - Elsevier
Generative data augmentation (GDA) has emerged as a promising technique to alleviate
data scarcity in machine learning applications. This thesis presents a comprehensive survey …

Expanding small-scale datasets with guided imagination

Y Zhang, D Zhou, B Hooi, K Wang… - Advances in neural …, 2023 - proceedings.neurips.cc
The power of DNNs relies heavily on the quantity and quality of training data. However,
collecting and annotating data on a large scale is often expensive and time-consuming. To …

Advances in diffusion models for image data augmentation: A review of methods, models, evaluation metrics and future research directions

P Alimisis, I Mademlis, P Radoglou-Grammatikis… - arXiv preprint arXiv …, 2024 - arxiv.org
Image data augmentation constitutes a critical methodology in modern computer vision
tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; …

Efficient dataset distillation via minimax diffusion

J Gu, S Vahidian, V Kungurtsev… - Proceedings of the …, 2024 - openaccess.thecvf.com
Dataset distillation reduces the storage and computational consumption of training a
network by generating a small surrogate dataset that encapsulates rich information of the …

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

HAAK Hammoud, H Itani, F Pizzati, P Torr… - arXiv preprint arXiv …, 2024 - arxiv.org
We present SynthCLIP, a novel framework for training CLIP models with entirely synthetic
text-image pairs, significantly departing from previous methods relying on real data …

Data augmentation for object detection via controllable diffusion models

H Fang, B Han, S Zhang, S Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Data augmentation is vital for object detection tasks that require expensive bounding box
annotations. Recent successes in diffusion models have inspired the use of diffusion-based …

Muffin or chihuahua? challenging multimodal large language models with multipanel vqa

Y Fan, J Gu, K Zhou, Q Yan, S Jiang… - Proceedings of the …, 2024 - aclanthology.org
Multipanel images, commonly seen as web screenshots, posters, etc., pervade our daily
lives. These images, characterized by their composition of multiple subfigures in distinct …

Active generation for image classification

T Huang, J Liu, S You, C Xu - European Conference on Computer Vision, 2025 - Springer
Recently, the growing capabilities of deep generative models have underscored their
potential in enhancing image classification accuracy. However, existing methods often …

Real-fake: Effective training data synthesis through distribution matching

J Yuan, J Zhang, S Sun, P Torr, B Zhao - arXiv preprint arXiv:2310.10402, 2023 - arxiv.org
Synthetic training data has gained prominence in numerous learning tasks and scenarios,
offering advantages such as dataset augmentation, generalization evaluation, and privacy …