Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Gan inversion: A survey

W Xia, Y Zhang, Y Yang, JH Xue… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
GAN inversion aims to invert a given image back into the latent space of a pretrained GAN
model so that the image can be faithfully reconstructed from the inverted code by the …

Semantic image synthesis via diffusion models

W Wang, J Bao, W Zhou, D Chen, D Chen… - arXiv preprint arXiv …, 2022 - arxiv.org
Denoising Diffusion Probabilistic Models (DDPMs) have achieved remarkable success in
various image generation tasks compared with Generative Adversarial Nets (GANs). Recent …

Freestyle layout-to-image synthesis

H Xue, Z Huang, Q Sun, L Song… - Proceedings of the …, 2023 - openaccess.thecvf.com
Typical layout-to-image synthesis (LIS) models generate images for a closed set of semantic
classes, eg, 182 common objects in COCO-Stuff. In this work, we explore the freestyle …

Unbalanced feature transport for exemplar-based image translation

F Zhan, Y Yu, K Cui, G Zhang, S Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite the great success of GANs in images translation with different conditioned inputs
such as semantic segmentation and edge map, generating high-fidelity images with …

You only need adversarial supervision for semantic image synthesis

V Sushko, E Schönfeld, D Zhang, J Gall… - arXiv preprint arXiv …, 2020 - arxiv.org
Despite their recent successes, GAN models for semantic image synthesis still suffer from
poor image quality when trained with only adversarial supervision. Historically, additionally …

Marginal contrastive correspondence for guided image generation

F Zhan, Y Yu, R Wu, J Zhang, S Lu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Exemplar-based image translation establishes dense correspondences between a
conditional input and an exemplar (from two different domains) for leveraging detailed …

Auto-regressive image synthesis with integrated quantization

F Zhan, Y Yu, R Wu, J Zhang, K Cui, C Zhang… - European Conference on …, 2022 - Springer
Deep generative models have achieved conspicuous progress in realistic image synthesis
with multifarious conditional inputs, while generating diverse yet high-fidelity images …

Emlight: Lighting estimation via spherical distribution approximation

F Zhan, C Zhang, Y Yu, Y Chang, S Lu, F Ma… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Illumination estimation from a single image is critical in 3D rendering and it has been
investigated extensively in the computer vision and computer graphic research community …

Semanticstylegan: Learning compositional generative priors for controllable image synthesis and editing

Y Shi, X Yang, Y Wan, X Shen - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Recent studies have shown that StyleGANs provide promising prior models for downstream
tasks on image synthesis and editing. However, since the latent codes of StyleGANs are …