InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models

P Cao, F Zhou, Q Song, L Yang - arXiv preprint arXiv:2403.04279, 2024 - arxiv.org

In the rapidly advancing realm of visual generation, diffusion models have revolutionized the
landscape, marking a significant shift in capabilities with their impressive text-guided …

被引用次数：25 相关文章所有 2 个版本

[PDF] openreview.net

Customizing text-to-image generation with inverted interaction

M Ge, X Jia, T Isobe, X Li, Q Wang, J Mu… - Proceedings of the …, 2024 - dl.acm.org

Subject-driven image generation, aimed at customizing user-specified subjects, has
experienced rapid progress. However, most of them focus on transferring the customized …

被引用次数：1 相关文章所有 3 个版本

Record: Reasoning and correcting diffusion for hoi generation

JY Jiang-Lin, KY Huang, L Lo, YN Huang… - Proceedings of the …, 2024 - dl.acm.org

Diffusion models revolutionize image generation by leveraging natural language to guide
the creation of multimedia content. Despite significant advancements in such generative …

被引用次数：3 相关文章所有 5 个版本

[PDF] arxiv.org

CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation

H Zhang, D Hong, T Gao, Y Wang, J Shao… - arXiv preprint arXiv …, 2024 - arxiv.org

Diffusion models have been recognized for their ability to generate images that are not only
visually appealing but also of high artistic quality. As a result, Layout-to-Image (L2I) …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Parametric-ControlNet: Multimodal Control in Foundation Models for Precise Engineering Design Synthesis

R Zhou, Y Zhang, C Yuan, F Permenter… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper introduces a generative model designed for multimodal control over text-to-
image foundation generative AI models such as Stable Diffusion, specifically tailored for …

高级搜索

QQ 群