Wouaf: Weight modulation for user attribution and fingerprinting in text-to-image diffusion models

C Kim, K Min, M Patel, S Cheng… - Proceedings of the …, 2024 - openaccess.thecvf.com
The rapid advancement of generative models facilitating the creation of hyper-realistic
images from textual descriptions has concurrently escalated critical societal concerns such …

Divide and conquer: Language models can plan and self-correct for compositional text-to-image generation

Z Wang, E Xie, A Li, Z Wang, X Liu, Z Li - arXiv preprint arXiv:2401.15688, 2024 - arxiv.org
Despite significant advancements in text-to-image models for generating high-quality
images, these methods still struggle to ensure the controllability of text prompts over images …

Conceptbed: Evaluating concept learning abilities of text-to-image diffusion models

M Patel, T Gokhale, C Baral, Y Yang - Proceedings of the AAAI …, 2024 - ojs.aaai.org
The ability to understand visual concepts and replicate and compose these concepts from
images is a central goal for computer vision. Recent advances in text-to-image (T2I) models …

-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space

M Patel, S Jung, C Baral, Y Yang - arXiv preprint arXiv:2402.05195, 2024 - arxiv.org
Despite the recent advances in personalized text-to-image (P-T2I) generative models,
subject-driven T2I remains challenging. The primary bottlenecks include 1) Intensive training …

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

K Sun, K Huang, X Liu, Y Wu, Z Xu, Z Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Text-to-video (T2V) generation models have advanced significantly, yet their ability to
compose different objects, attributes, actions, and motions into a video remains unexplored …

Strengthening Image Generative AI: Integrating Fingerprinting and Revision Methods for Enhanced Safety and Control

C Kim - 2024 - keep.lib.asu.edu
In the rapidly evolving field of Generative Artificial Intelligence (Gen-AI) for imaging, models
such as DALL· E3 and Stable Diffusion have transitioned from theoretical concepts to …