- 学术资源搜索

Learning vision from models rivals learning vision from data

Y Tian, L Fan, K Chen, D Katabi… - Proceedings of the …, 2024 - openaccess.thecvf.com

We introduce SynCLR a novel approach for learning visual representations exclusively from
synthetic images without any real data. We synthesize a large dataset of image captions …

被引用次数：42 相关文章所有 3 个版本

A comprehensive survey for generative data augmentation

Y Chen, Z Yan, Y Zhu - Neurocomputing, 2024 - Elsevier

Generative data augmentation (GDA) has emerged as a promising technique to alleviate
data scarcity in machine learning applications. This thesis presents a comprehensive survey …

被引用次数：4 相关文章所有 2 个版本

[PDF] neurips.cc

Expanding small-scale datasets with guided imagination

Y Zhang, D Zhou, B Hooi, K Wang… - Advances in neural …, 2023 - proceedings.neurips.cc

The power of DNNs relies heavily on the quantity and quality of training data. However,
collecting and annotating data on a large scale is often expensive and time-consuming. To …

被引用次数：44 相关文章所有 6 个版本

[PDF] arxiv.org

Advances in diffusion models for image data augmentation: A review of methods, models, evaluation metrics and future research directions

P Alimisis, I Mademlis, P Radoglou-Grammatikis… - arXiv preprint arXiv …, 2024 - arxiv.org

Image data augmentation constitutes a critical methodology in modern computer vision
tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; …

被引用次数：3 相关文章所有 3 个版本

[PDF] thecvf.com

Efficient dataset distillation via minimax diffusion

J Gu, S Vahidian, V Kungurtsev… - Proceedings of the …, 2024 - openaccess.thecvf.com

Dataset distillation reduces the storage and computational consumption of training a
network by generating a small surrogate dataset that encapsulates rich information of the …

被引用次数：19 相关文章所有 3 个版本

[PDF] arxiv.org

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

HAAK Hammoud, H Itani, F Pizzati, P Torr… - arXiv preprint arXiv …, 2024 - arxiv.org

We present SynthCLIP, a novel framework for training CLIP models with entirely synthetic
text-image pairs, significantly departing from previous methods relying on real data …

被引用次数：34 相关文章所有 4 个版本

[PDF] thecvf.com

Data augmentation for object detection via controllable diffusion models

H Fang, B Han, S Zhang, S Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com

Data augmentation is vital for object detection tasks that require expensive bounding box
annotations. Recent successes in diffusion models have inspired the use of diffusion-based …

被引用次数：35 相关文章所有 4 个版本

[PDF] aclanthology.org

Muffin or chihuahua? challenging multimodal large language models with multipanel vqa

Y Fan, J Gu, K Zhou, Q Yan, S Jiang… - Proceedings of the …, 2024 - aclanthology.org

Multipanel images, commonly seen as web screenshots, posters, etc., pervade our daily
lives. These images, characterized by their composition of multiple subfigures in distinct …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Active generation for image classification

T Huang, J Liu, S You, C Xu - European Conference on Computer Vision, 2025 - Springer

Recently, the growing capabilities of deep generative models have underscored their
potential in enhancing image classification accuracy. However, existing methods often …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Real-fake: Effective training data synthesis through distribution matching

J Yuan, J Zhang, S Sun, P Torr, B Zhao - arXiv preprint arXiv:2310.10402, 2023 - arxiv.org

Synthetic training data has gained prominence in numerous learning tasks and scenarios,
offering advantages such as dataset augmentation, generalization evaluation, and privacy …

被引用次数：21 相关文章所有 3 个版本

高级搜索

QQ 群

Learning vision from models rivals learning vision from data

A comprehensive survey for generative data augmentation

Expanding small-scale datasets with guided imagination

Advances in diffusion models for image data augmentation: A review of methods, models, evaluation metrics and future research directions

Efficient dataset distillation via minimax diffusion

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

Data augmentation for object detection via controllable diffusion models

Muffin or chihuahua? challenging multimodal large language models with multipanel vqa

Active generation for image classification

Real-fake: Effective training data synthesis through distribution matching

引用