Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Reco: Region-controlled text-to-image generation

Z Yang, J Wang, Z Gan, L Li, K Lin… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, large-scale text-to-image (T2I) models have shown impressive performance in
generating high-fidelity images, but with limited controllability, eg, precisely specifying the …

Unbalanced feature transport for exemplar-based image translation

F Zhan, Y Yu, K Cui, G Zhang, S Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite the great success of GANs in images translation with different conditioned inputs
such as semantic segmentation and edge map, generating high-fidelity images with …

Auto-regressive image synthesis with integrated quantization

F Zhan, Y Yu, R Wu, J Zhang, K Cui, C Zhang… - European Conference on …, 2022 - Springer
Deep generative models have achieved conspicuous progress in realistic image synthesis
with multifarious conditional inputs, while generating diverse yet high-fidelity images …

Scenecomposer: Any-level semantic image synthesis

Y Zeng, Z Lin, J Zhang, Q Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a new framework for conditional image synthesis from semantic layouts of any
precision levels, ranging from pure text to a 2D semantic canvas with precise shapes. More …

Modeling image composition for complex scene generation

Z Yang, D Liu, C Wang, J Yang… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We present a method that achieves state-of-the-art results on challenging (few-shot) layout-
to-image generation tasks by accurately modeling textures, structures and relationships …

Image synthesis from layout with locality-aware mask adaption

Z Li, J Wu, I Koh, Y Tang, L Sun - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
This paper is concerned with synthesizing images conditioned on a layout (a set of
bounding boxes with object categories). Existing works construct a layout-mask-image …

FloorplanGAN: Vector residential floorplan adversarial generation

Z Luo, W Huang - Automation in Construction, 2022 - Elsevier
An architectural floorplan is a class of drawings that reflects the layout of rooms. The
difference between a floorplan and a natural image and its dual features as both a vector …

Conditional 360-degree image synthesis for immersive indoor scene decoration

KC Shum, HW Pang, BS Hua… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we address the problem of conditional scene decoration for 360deg images.
Our method takes a 360deg background photograph of an indoor scene and generates …

Bi-level feature alignment for versatile image translation and manipulation

F Zhan, Y Yu, R Wu, J Zhang, K Cui, A Xiao… - … on Computer Vision, 2022 - Springer
Generative adversarial networks (GANs) have achieved great success in image translation
and manipulation. However, high-fidelity image generation with faithful style control remains …