Bootpig: Bootstrapping zero-shot personalized image generation capabilities in pretrained...

Y Wei, Z Ji, J Bai, H Zhang, L Zhang, W Zuo - European Conference on …, 2025 - Springer

Abstract Text-to-image (T2I) diffusion models have shown significant success in
personalized text-to-image generation, which aims to generate novel images with human …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics

X Chen, Z Zhang, H Zhang, Y Zhou, SY Kim… - arXiv preprint arXiv …, 2024 - arxiv.org

We introduce UniReal, a unified framework designed to address various image generation
and editing tasks. Existing solutions often vary by tasks, yet share fundamental principles …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

MagicFace: Training-free Universal-Style Human Image Customized Synthesis

Y Wang, W Zhang, C Jin - arXiv preprint arXiv:2408.07433, 2024 - arxiv.org

Current human image customization methods leverage Stable Diffusion (SD) for its rich
semantic prior. However, since SD is not specifically designed for human-oriented …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

A Survey on Personalized Content Synthesis with Diffusion Models

X Zhang, XY Wei, W Zhang, J Wu, Z Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Recent advancements in generative models have significantly impacted content creation,
leading to the emergence of Personalized Content Synthesis (PCS). With a small set of user …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts

Y Han, R Wang, C Zhang, J Hu, P Cheng, B Fu… - arXiv preprint arXiv …, 2024 - arxiv.org

Recent advancements in image generation have enabled the creation of high-quality
images from text conditions. However, when facing multi-modal conditions, such as text …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

高级搜索

QQ 群