Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Image generation: A review

M Elasri, O Elharrouss, S Al-Maadeed, H Tairi - Neural Processing Letters, 2022 - Springer
The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …

Image-to-image translation: Methods and applications

Y Pang, J Lin, T Qin, Z Chen - IEEE Transactions on Multimedia, 2021 - ieeexplore.ieee.org
Image-to-image translation (I2I) aims to transfer images from a source domain to a target
domain while preserving the content representations. I2I has drawn increasing attention and …

Unbalanced feature transport for exemplar-based image translation

F Zhan, Y Yu, K Cui, G Zhang, S Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite the great success of GANs in images translation with different conditioned inputs
such as semantic segmentation and edge map, generating high-fidelity images with …

Marginal contrastive correspondence for guided image generation

F Zhan, Y Yu, R Wu, J Zhang, S Lu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Exemplar-based image translation establishes dense correspondences between a
conditional input and an exemplar (from two different domains) for leveraging detailed …

Xinggan for person image generation

H Tang, S Bai, L Zhang, PHS Torr, N Sebe - Computer Vision–ECCV 2020 …, 2020 - Springer
We propose a novel Generative Adversarial Network (XingGAN or CrossingGAN) for person
image generation tasks, ie, translating the pose of a given person to a desired one. The …

Auto-regressive image synthesis with integrated quantization

F Zhan, Y Yu, R Wu, J Zhang, K Cui, C Zhang… - European Conference on …, 2022 - Springer
Deep generative models have achieved conspicuous progress in realistic image synthesis
with multifarious conditional inputs, while generating diverse yet high-fidelity images …

Attentiongan: Unpaired image-to-image translation using attention-guided generative adversarial networks

H Tang, H Liu, D Xu, PHS Torr… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
State-of-the-art methods in the image-to-image translation are capable of learning a
mapping from a source domain to a target domain with unpaired image data. Though the …

Deep image spatial transformation for person image generation

Y Ren, X Yu, J Chen, TH Li, G Li - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Pose-guided person image generation is to transform a source person image to a target
pose. This task requires spatial manipulations of source data. However, Convolutional …

Towards unified text-based person retrieval: A large-scale multi-attribute and language search benchmark

S Yang, Y Zhou, Z Zheng, Y Wang, L Zhu… - Proceedings of the 31st …, 2023 - dl.acm.org
In this paper, we introduce a large Multi-Attribute and Language Search dataset for text-
based person retrieval, called MALS, and explore the feasibility of performing pre-training on …