Freedom: Training-free energy-guided conditional diffusion model

J Yu, Y Wang, C Zhao, B Ghanem… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, conditional diffusion models have gained popularity in numerous applications due
to their exceptional generation ability. However, many existing methods are training …

Learning visual prior via generative pre-training

J Xie, K Ye, Y Li, Y Li, KQ Lin, Y Zheng… - Advances in …, 2024 - proceedings.neurips.cc
Various stuff and things in visual data possess specific traits, which can be learned by deep
neural networks and are implicitly represented as the visual prior, eg, object location and …

Dynamically masked discriminator for GANs

W Zhang, H Liu, B Li, J Xie, Y Huang… - Advances in …, 2024 - proceedings.neurips.cc
Abstract Training Generative Adversarial Networks (GANs) remains a challenging problem.
The discriminator trains the generator by learning the distribution of real/generated data …

FNContra: Frequency-domain Negative Sample Mining in Contrastive Learning for limited-data image generation

Q Yang, Z Zhao, Y Pu, S Pan, J Gu, D Xu - Expert Systems with Applications, 2025 - Elsevier
Substantial training data is necessary to train an effective generative adversarial network
(GANs), without which the discriminator is easily overfitting, causing the sub-optimal models …

CLR-GAN: Improving GANs Stability and Quality via Consistent Latent Representation and Reconstruction

S Sun, Z Luan, Z Zhao, S Luo, S Han - European Conference on Computer …, 2025 - Springer
Abstract Generative Adversarial Networks (GANs) have received considerable attention due
to its outstanding ability to generate images. However, training a GAN is hard since the …

Artistry in Pixels: FVS-A Framework for Evaluating Visual Elegance and Sentiment Resonance in Generated Images

W Li, L Xiao, X Wu, T Ma, J Zhao… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
The field of image generation models has seen substantial progress, characterized by a
proliferation of diverse generative models and their associated outputs. However, there …

Dynamic Token Augmentation Mamba for Cross-Scene Classification of Hyperspectral Image

X Huang, Y Zhang, F Luo… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Cross-scene classification of hyperspectral image (HSI) based on single-source domain
generalization (SDG) focuses on developing a model that can effectively classify images …

Improving the Training of the GANs with Limited Data via Dual Adaptive Noise Injection

Z Zhang, Y Hua, G Sun, H Wang… - Proceedings of the 32nd …, 2024 - dl.acm.org
Recently, many studies have highlighted that training Generative Adversarial Networks
(GANs) with limited data suffers from the overfitting of the discriminator (D). Existing studies …

StableTalk: Advancing Audio-to-Talking Face Generation with Stable Diffusion and Vision Transformer

F Nazarieh, J Kittler, MA Rana, D Kanojia… - … Conference on Pattern …, 2025 - Springer
Audio-to-talking face generation stands at the forefront of advancements in generative AI. It
bridges the gap between audio and visual representations by generating synchronized and …

Manifold Constraint Regularization for Remote Sensing Image Generation

X Su, C Zheng, W Qiang, F Wu, J Zhao… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Generative adversarial networks (GANs) have shown notable accomplishments in remote
sensing (RS) domain. However, this article reveals that their performance on RS images …