Discovering interpretable directions in the semantic latent space of diffusion models

W Xia, JH Xue - ACM Computing Surveys, 2023 - dl.acm.org

Recent years have seen remarkable progress in deep learning powered visual content
creation. This includes deep generative 3D-aware image synthesis, which produces high …

被引用次数：27 相关文章所有 5 个版本

[PDF] arxiv.org

Concept sliders: Lora adaptors for precise control in diffusion models

R Gandikota, J Materzyńska, T Zhou, A Torralba… - … on Computer Vision, 2025 - Springer

We present a method to create interpretable concept sliders that enable precise control over
attributes in image generations from diffusion models. Our approach identifies a low-rank …

被引用次数：34 相关文章所有 2 个版本

A neural space-time representation for text-to-image personalization

Y Alaluf, E Richardson, G Metzer… - ACM Transactions on …, 2023 - dl.acm.org

A key aspect of text-to-image personalization methods is the manner in which the target
concept is represented within the generative process. This choice greatly affects the visual …

被引用次数：69 相关文章所有 3 个版本

[PDF] neurips.cc

Understanding the latent space of diffusion models through the lens of riemannian geometry

YH Park, M Kwon, J Choi, J Jo… - Advances in Neural …, 2023 - proceedings.neurips.cc

Despite the success of diffusion models (DMs), we still lack a thorough understanding of
their latent space. To understand the latent space $\mathbf {x} _t\in\mathcal {X} $, we …

被引用次数：45 相关文章所有 6 个版本

[PDF] thecvf.com

Self-discovering interpretable diffusion latent directions for responsible text-to-image generation

H Li, C Shen, P Torr, V Tresp… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Diffusion-based models have gained significant popularity for text-to-image generation due
to their exceptional image-generation capabilities. A risk with these models is the potential …

被引用次数：24 相关文章所有 4 个版本

[HTML] acm.org

Loosecontrol: Lifting controlnet for generalized depth conditioning

SF Bhat, N Mitra, P Wonka - ACM SIGGRAPH 2024 Conference Papers, 2024 - dl.acm.org

We present LooseControl to allow generalized depth conditioning for diffusion-based image
generation. ControlNet, the SOTA for depth conditioned image generation, produces …

被引用次数：18 相关文章所有 3 个版本

[PDF] neurips.cc

Concept algebra for (score-based) text-controlled generative models

Z Wang, L Gui, J Negrea… - Advances in Neural …, 2024 - proceedings.neurips.cc

This paper concerns the structure of learned representations in text-guided generative
models, focusing on score-based models. A key property of such models is that they can …

被引用次数：22 相关文章所有 6 个版本

[PDF] thecvf.com

Noiseclr: A contrastive learning approach for unsupervised discovery of interpretable directions in diffusion models

Y Dalva, P Yanardag - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Generative models have been very popular in the recent years for their image generation
capabilities. GAN-based models are highly regarded for their disentangled latent space …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Exploring low-dimensional subspaces in diffusion models for controllable image editing

S Chen, H Zhang, M Guo, Y Lu, P Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Recently, diffusion models have emerged as a powerful class of generative models. Despite
their success, there is still limited understanding of their semantic spaces. This makes it …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

Emergence of hidden capabilities: Exploring learning dynamics in concept space

CF Park, M Okawa, A Lee, ES Lubana… - arXiv preprint arXiv …, 2024 - arxiv.org

Modern generative models demonstrate impressive capabilities, likely stemming from an
ability to identify and manipulate abstract concepts underlying their training data. However …

被引用次数：4 相关文章所有 3 个版本

高级搜索

QQ 群