Self-consuming generative models go mad

S Alemohammad, J Casco-Rodriguez, L Luzi… - arXiv preprint arXiv …, 2023 - arxiv.org
Seismic advances in generative AI algorithms for imagery, text, and other data types has led
to the temptation to use synthetic data to train next-generation models. Repeating this …

Fine-tuning multimodal llms to follow zero-shot demonstrative instructions

J Li, K Pan, Z Ge, M Gao, W Ji, W Zhang… - The Twelfth …, 2023 - openreview.net
Recent advancements in Multimodal Large Language Models (MLLMs) have been utilizing
Visual Prompt Generators (VPGs) to convert visual features into tokens that LLMs can …

A comprehensive survey for generative data augmentation

Y Chen, Z Yan, Y Zhu - Neurocomputing, 2024 - Elsevier
Generative data augmentation (GDA) has emerged as a promising technique to alleviate
data scarcity in machine learning applications. This thesis presents a comprehensive survey …

A unified framework for generative data augmentation: A comprehensive survey

Y Chen, Z Yan, Y Zhu - arXiv preprint arXiv:2310.00277, 2023 - arxiv.org
Generative data augmentation (GDA) has emerged as a promising technique to alleviate
data scarcity in machine learning applications. This thesis presents a comprehensive survey …

A tale of tails: Model collapse as a change of scaling laws

E Dohmatob, Y Feng, P Yang, F Charton… - arXiv preprint arXiv …, 2024 - arxiv.org
As AI model size grows, neural scaling laws have become a crucial tool to predict the
improvements of large models when increasing capacity and the size of original (human or …

Ai-generated images as data source: The dawn of synthetic era

Z Yang, F Zhan, K Liu, M Xu, S Lu - arXiv preprint arXiv:2310.01830, 2023 - arxiv.org
The advancement of visual intelligence is intrinsically tethered to the availability of data. In
parallel, generative Artificial Intelligence (AI) has unlocked the potential to create synthetic …

VLM-guided Explicit-Implicit Complementary novel class semantic learning for few-shot object detection

T Zhao, H Qiu, Y Dai, L Wang, H Mei, F Meng… - Expert Systems with …, 2024 - Elsevier
Few-shot object detection (FSOD) aims at learning a novel class object detector with
abundant base class samples and a limited number of novel class samples. Some recent …

Interactive data synthesis for systematic vision adaptation via llms-aigcs collaboration

Q Yu, J Li, W Ye, S Tang, Y Zhuang - arXiv preprint arXiv:2305.12799, 2023 - arxiv.org
Recent text-to-image generation models have shown promising results in generating high-
fidelity photo-realistic images. In parallel, the problem of data scarcity has brought a growing …

Datadream: Few-shot guided dataset generation

JM Kim, J Bader, S Alaniz, C Schmid… - European Conference on …, 2025 - Springer
While text-to-image diffusion models have been shown to achieve state-of-the-art results in
image synthesis, they have yet to prove their effectiveness in downstream applications …

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images

Z Yu, C Zhu, S Culatana, R Krishnamoorthi… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent advances in generative deep learning have enabled the creation of high-quality
synthetic images in text-to-image generation. Prior work shows that fine-tuning a pretrained …