Image generation: A review

M Elasri, O Elharrouss, S Al-Maadeed, H Tairi - Neural Processing Letters, 2022 - Springer
The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …

Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation

L Qu, S Wu, H Fei, L Nie, TS Chua - Proceedings of the 31st ACM …, 2023 - dl.acm.org
In the text-to-image generation field, recent remarkable progress in Stable Diffusion makes it
possible to generate rich kinds of novel photorealistic images. However, current models still …

Spatial-temporal transformer for dynamic scene graph generation

Y Cong, W Liao, H Ackermann… - Proceedings of the …, 2021 - openaccess.thecvf.com
Dynamic scene graph generation aims at generating a scene graph of the given video.
Compared to the task of scene graph generation from images, it is more challenging …

Frido: Feature pyramid diffusion for complex scene image synthesis

WC Fan, YC Chen, DD Chen, Y Cheng… - Proceedings of the …, 2023 - ojs.aaai.org
Diffusion models (DMs) have shown great potential for high-quality image synthesis.
However, when it comes to producing images with complex scenes, how to properly …

Text to image generation with semantic-spatial aware gan

W Liao, K Hu, MY Yang… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Text-to-image synthesis (T2I) aims to generate photo-realistic images which are
semantically consistent with the text descriptions. Existing methods are usually built upon …

Scenecomposer: Any-level semantic image synthesis

Y Zeng, Z Lin, J Zhang, Q Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a new framework for conditional image synthesis from semantic layouts of any
precision levels, ranging from pure text to a 2D semantic canvas with precise shapes. More …

DrivingDiffusion: Layout-Guided Multi-view Driving Scenarios Video Generation with Latent Diffusion Model

X Li, Y Zhang, X Ye - European Conference on Computer Vision, 2024 - Springer
With the surge in autonomous driving technologies, the reliance on comprehensive and high-
definition bird's-eye-view (BEV) representations has become paramount. This burgeoning …

Layoutdiffuse: Adapting foundational diffusion models for layout-to-image generation

J Cheng, X Liang, X Shi, T He, T Xiao, M Li - arXiv preprint arXiv …, 2023 - arxiv.org
Layout-to-image generation refers to the task of synthesizing photo-realistic images based
on semantic layouts. In this paper, we propose LayoutDiffuse that adapts a foundational …

Modeling image composition for complex scene generation

Z Yang, D Liu, C Wang, J Yang… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We present a method that achieves state-of-the-art results on challenging (few-shot) layout-
to-image generation tasks by accurately modeling textures, structures and relationships …

Diffusion-based scene graph to image generation with masked contrastive pre-training

L Yang, Z Huang, Y Song, S Hong, G Li… - arXiv preprint arXiv …, 2022 - arxiv.org
Generating images from graph-structured inputs, such as scene graphs, is uniquely
challenging due to the difficulty of aligning nodes and connections in graphs with objects …