PIXART- : Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

J Chen, C Ge, E Xie, Y Wu, L Yao, X Ren… - … on Computer Vision, 2025 - Springer
In this paper, we introduce PixArt-\(\Sigma\), a Diffusion Transformer model (DiT) capable of
directly generating images at 4K resolution. PixArt-\(\Sigma\) represents a significant …

Fast high-resolution image synthesis with latent adversarial diffusion distillation

A Sauer, F Boesel, T Dockhorn, A Blattmann… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
Diffusion models are the main driver of progress in image and video synthesis, but suffer
from slow inference speed. Distillation methods, like the recently introduced adversarial …

Distilling diffusion models into conditional gans

M Kang, R Zhang, C Barnes, S Paris, S Kwak… - … on Computer Vision, 2025 - Springer
We propose a method to distill a complex multistep diffusion model into a single-step
conditional GAN student model, dramatically accelerating inference, while preserving image …

Diffusion model-based video editing: A survey

W Sun, RC Tu, J Liao, D Tao - arXiv preprint arXiv:2407.07111, 2024 - arxiv.org
The rapid development of diffusion models (DMs) has significantly advanced image and
video applications, making" what you want is what you see" a reality. Among these, video …

Revisiting non-autoregressive transformers for efficient image synthesis

Z Ni, Y Wang, R Zhou, J Guo, J Hu… - Proceedings of the …, 2024 - openaccess.thecvf.com
The field of image synthesis is currently flourishing due to the advancements in diffusion
models. While diffusion models have been successful their computational intensity has …

Efficient diffusion models: A comprehensive survey from principles to practices

Z Ma, Y Zhang, G Jia, L Zhao, Y Ma, M Ma… - arXiv preprint arXiv …, 2024 - arxiv.org
As one of the most popular and sought-after generative models in the recent years, diffusion
models have sparked the interests of many researchers and steadily shown excellent …

Lazy diffusion transformer for interactive image editing

Y Nitzan, Z Wu, R Zhang, E Shechtman… - … on Computer Vision, 2025 - Springer
We introduce a novel diffusion transformer, LazyDiffusion, that generates partial image
updates efficiently. Our approach targets interactive image editing applications in which …

UIEDP: Boosting underwater image enhancement with diffusion prior

D Du, E Li, L Si, W Zhai, F Xu, J Niu, F Sun - Expert Systems with …, 2025 - Elsevier
Underwater image enhancement (UIE) aims to generate clear images from low-quality
underwater images. Due to the unavailability of clear reference images, researchers often …

Deep compression autoencoder for efficient high-resolution diffusion models

J Chen, H Cai, J Chen, E Xie, S Yang, H Tang… - arXiv preprint arXiv …, 2024 - arxiv.org
We present Deep Compression Autoencoder (DC-AE), a new family of autoencoder models
for accelerating high-resolution diffusion models. Existing autoencoder models have …

Turboedit: Instant text-based image editing

Z Wu, N Kolkin, J Brandt, R Zhang… - European Conference on …, 2025 - Springer
We address the challenges of precise image inversion and disentangled image editing in
the context of few-step diffusion models. We introduce an encoder based iterative inversion …