Explore the power of synthetic data on few-shot object detection

S Alemohammad, J Casco-Rodriguez, L Luzi… - arXiv preprint arXiv …, 2023 - arxiv.org

Seismic advances in generative AI algorithms for imagery, text, and other data types has led
to the temptation to use synthetic data to train next-generation models. Repeating this …

被引用次数：142 相关文章所有 4 个版本

[PDF] openreview.net

Fine-tuning multimodal llms to follow zero-shot demonstrative instructions

J Li, K Pan, Z Ge, M Gao, W Ji, W Zhang… - The Twelfth …, 2023 - openreview.net

Recent advancements in Multimodal Large Language Models (MLLMs) have been utilizing
Visual Prompt Generators (VPGs) to convert visual features into tokens that LLMs can …

被引用次数：67 相关文章所有 2 个版本

A comprehensive survey for generative data augmentation

Y Chen, Z Yan, Y Zhu - Neurocomputing, 2024 - Elsevier

Generative data augmentation (GDA) has emerged as a promising technique to alleviate
data scarcity in machine learning applications. This thesis presents a comprehensive survey …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

A unified framework for generative data augmentation: A comprehensive survey

Y Chen, Z Yan, Y Zhu - arXiv preprint arXiv:2310.00277, 2023 - arxiv.org

Generative data augmentation (GDA) has emerged as a promising technique to alleviate
data scarcity in machine learning applications. This thesis presents a comprehensive survey …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

A tale of tails: Model collapse as a change of scaling laws

E Dohmatob, Y Feng, P Yang, F Charton… - arXiv preprint arXiv …, 2024 - arxiv.org

As AI model size grows, neural scaling laws have become a crucial tool to predict the
improvements of large models when increasing capacity and the size of original (human or …

被引用次数：40 相关文章所有 3 个版本

[PDF] arxiv.org

Ai-generated images as data source: The dawn of synthetic era

Z Yang, F Zhan, K Liu, M Xu, S Lu - arXiv preprint arXiv:2310.01830, 2023 - arxiv.org

The advancement of visual intelligence is intrinsically tethered to the availability of data. In
parallel, generative Artificial Intelligence (AI) has unlocked the potential to create synthetic …

被引用次数：20 相关文章所有 4 个版本

[PDF] ssrn.com

VLM-guided Explicit-Implicit Complementary novel class semantic learning for few-shot object detection

T Zhao, H Qiu, Y Dai, L Wang, H Mei, F Meng… - Expert Systems with …, 2024 - Elsevier

Few-shot object detection (FSOD) aims at learning a novel class object detector with
abundant base class samples and a limited number of novel class samples. Some recent …

被引用次数：3 相关文章

[PDF] arxiv.org

Interactive data synthesis for systematic vision adaptation via llms-aigcs collaboration

Q Yu, J Li, W Ye, S Tang, Y Zhuang - arXiv preprint arXiv:2305.12799, 2023 - arxiv.org

Recent text-to-image generation models have shown promising results in generating high-
fidelity photo-realistic images. In parallel, the problem of data scarcity has brought a growing …

被引用次数：14 相关文章所有 2 个版本

[PDF] arxiv.org

Datadream: Few-shot guided dataset generation

JM Kim, J Bader, S Alaniz, C Schmid… - European Conference on …, 2025 - Springer

While text-to-image diffusion models have been shown to achieve state-of-the-art results in
image synthesis, they have yet to prove their effectiveness in downstream applications …

被引用次数：1 相关文章所有 5 个版本

[PDF] arxiv.org

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images

Z Yu, C Zhu, S Culatana, R Krishnamoorthi… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent advances in generative deep learning have enabled the creation of high-quality
synthetic images in text-to-image generation. Prior work shows that fine-tuning a pretrained …

被引用次数：6 相关文章所有 2 个版本

高级搜索

QQ 群