Text-to-image diffusion models in generative ai: A survey

C Zhang, C Zhang, M Zhang, IS Kweon - arXiv preprint arXiv:2303.07909, 2023 - arxiv.org
This survey reviews text-to-image diffusion models in the context that diffusion models have
emerged to be popular for a wide range of generative tasks. As a self-contained work, this …

Easily accessible text-to-image generation amplifies demographic stereotypes at large scale

F Bianchi, P Kalluri, E Durmus, F Ladhak… - Proceedings of the …, 2023 - dl.acm.org
Machine learning models that convert user-written text descriptions into images are now
widely available online and used by millions of users to generate millions of images a day …

When and why vision-language models behave like bags-of-words, and what to do about it?

M Yuksekgonul, F Bianchi, P Kalluri… - The Eleventh …, 2023 - openreview.net
Despite the success of large vision and language models (VLMs) in many downstream
applications, it is unclear how well they encode the compositional relationships between …

Stable bias: Evaluating societal representations in diffusion models

S Luccioni, C Akiki, M Mitchell… - Advances in Neural …, 2024 - proceedings.neurips.cc
As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly
prevalent and seeing growing adoption as commercial services, characterizing the social …

Stable bias: Analyzing societal representations in diffusion models

AS Luccioni, C Akiki, M Mitchell, Y Jernite - arXiv preprint arXiv …, 2023 - arxiv.org
As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly
prevalent and seeing growing adoption as commercial services, characterizing the social …

Iti-gen: Inclusive text-to-image generation

C Zhang, X Chen, S Chai, CH Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-to-image generative models often reflect the biases of the training data, leading to
unequal representations of underrepresented groups. This study investigates inclusive text …

Dall-eval: Probing the reasoning skills and social biases of text-to-image generation models

J Cho, A Zala, M Bansal - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Recently, DALL-E, a multimodal transformer language model, and its variants including
diffusion models have shown high-quality text-to-image generation capabilities. However …

AI's regimes of representation: A community-centered study of text-to-image models in South Asia

R Qadri, R Shelby, CL Bennett, E Denton - Proceedings of the 2023 …, 2023 - dl.acm.org
This paper presents a community-centered study of cultural limitations of text-to-image (T2I)
models in the South Asian context. We theorize these failures using scholarship on …

Leaving reality to imagination: Robust classification via generated datasets

H Bansal, A Grover - arXiv preprint arXiv:2302.02503, 2023 - arxiv.org
Recent research on robustness has revealed significant performance gaps between neural
image classifiers trained on datasets that are similar to the test set, and those that are from a …

Fair diffusion: Instructing text-to-image generation models on fairness

F Friedrich, M Brack, L Struppek, D Hintersdorf… - arXiv preprint arXiv …, 2023 - arxiv.org
Generative AI models have recently achieved astonishing results in quality and are
consequently employed in a fast-growing number of applications. However, since they are …