How well can text-to-image generative models understand ethical natural language interventions?

C Zhang, C Zhang, M Zhang, IS Kweon - arXiv preprint arXiv:2303.07909, 2023 - arxiv.org

This survey reviews text-to-image diffusion models in the context that diffusion models have
emerged to be popular for a wide range of generative tasks. As a self-contained work, this …

被引用次数：214 相关文章所有 4 个版本

[PDF] arxiv.org

Easily accessible text-to-image generation amplifies demographic stereotypes at large scale

F Bianchi, P Kalluri, E Durmus, F Ladhak… - Proceedings of the …, 2023 - dl.acm.org

Machine learning models that convert user-written text descriptions into images are now
widely available online and used by millions of users to generate millions of images a day …

被引用次数：178 相关文章所有 4 个版本

[PDF] openreview.net

When and why vision-language models behave like bags-of-words, and what to do about it?

M Yuksekgonul, F Bianchi, P Kalluri… - The Eleventh …, 2023 - openreview.net

Despite the success of large vision and language models (VLMs) in many downstream
applications, it is unclear how well they encode the compositional relationships between …

被引用次数：202 相关文章所有 3 个版本

[PDF] neurips.cc

Stable bias: Evaluating societal representations in diffusion models

S Luccioni, C Akiki, M Mitchell… - Advances in Neural …, 2024 - proceedings.neurips.cc

As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly
prevalent and seeing growing adoption as commercial services, characterizing the social …

被引用次数：40 相关文章所有 4 个版本

[PDF] arxiv.org

Stable bias: Analyzing societal representations in diffusion models

AS Luccioni, C Akiki, M Mitchell, Y Jernite - arXiv preprint arXiv …, 2023 - arxiv.org

As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly
prevalent and seeing growing adoption as commercial services, characterizing the social …

被引用次数：107 相关文章

[PDF] thecvf.com

Iti-gen: Inclusive text-to-image generation

C Zhang, X Chen, S Chai, CH Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Text-to-image generative models often reflect the biases of the training data, leading to
unequal representations of underrepresented groups. This study investigates inclusive text …

被引用次数：27 相关文章所有 5 个版本

[PDF] thecvf.com

Dall-eval: Probing the reasoning skills and social biases of text-to-image generation models

J Cho, A Zala, M Bansal - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Recently, DALL-E, a multimodal transformer language model, and its variants including
diffusion models have shown high-quality text-to-image generation capabilities. However …

被引用次数：82 相关文章所有 5 个版本

[PDF] acm.org

AI's regimes of representation: A community-centered study of text-to-image models in South Asia

R Qadri, R Shelby, CL Bennett, E Denton - Proceedings of the 2023 …, 2023 - dl.acm.org

This paper presents a community-centered study of cultural limitations of text-to-image (T2I)
models in the South Asian context. We theorize these failures using scholarship on …

被引用次数：30 相关文章所有 5 个版本

[PDF] arxiv.org

Leaving reality to imagination: Robust classification via generated datasets

H Bansal, A Grover - arXiv preprint arXiv:2302.02503, 2023 - arxiv.org

Recent research on robustness has revealed significant performance gaps between neural
image classifiers trained on datasets that are similar to the test set, and those that are from a …

被引用次数：56 相关文章所有 5 个版本

[PDF] arxiv.org

Fair diffusion: Instructing text-to-image generation models on fairness

F Friedrich, M Brack, L Struppek, D Hintersdorf… - arXiv preprint arXiv …, 2023 - arxiv.org

Generative AI models have recently achieved astonishing results in quality and are
consequently employed in a fast-growing number of applications. However, since they are …

被引用次数：62 相关文章所有 2 个版本

高级搜索

QQ 群