Context-aware layout to image generation with enhanced object appearance

M Elasri, O Elharrouss, S Al-Maadeed, H Tairi - Neural Processing Letters, 2022 - Springer

The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …

被引用次数：99 相关文章所有 6 个版本

[PDF] acm.org

Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation

L Qu, S Wu, H Fei, L Nie, TS Chua - Proceedings of the 31st ACM …, 2023 - dl.acm.org

In the text-to-image generation field, recent remarkable progress in Stable Diffusion makes it
possible to generate rich kinds of novel photorealistic images. However, current models still …

被引用次数：94 相关文章所有 3 个版本

[PDF] thecvf.com

Spatial-temporal transformer for dynamic scene graph generation

Y Cong, W Liao, H Ackermann… - Proceedings of the …, 2021 - openaccess.thecvf.com

Dynamic scene graph generation aims at generating a scene graph of the given video.
Compared to the task of scene graph generation from images, it is more challenging …

被引用次数：153 相关文章所有 12 个版本

[PDF] aaai.org

Frido: Feature pyramid diffusion for complex scene image synthesis

WC Fan, YC Chen, DD Chen, Y Cheng… - Proceedings of the …, 2023 - ojs.aaai.org

Diffusion models (DMs) have shown great potential for high-quality image synthesis.
However, when it comes to producing images with complex scenes, how to properly …

被引用次数：80 相关文章所有 4 个版本

[PDF] thecvf.com

Text to image generation with semantic-spatial aware gan

W Liao, K Hu, MY Yang… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Text-to-image synthesis (T2I) aims to generate photo-realistic images which are
semantically consistent with the text descriptions. Existing methods are usually built upon …

被引用次数：177 相关文章所有 9 个版本

[PDF] thecvf.com

Scenecomposer: Any-level semantic image synthesis

Y Zeng, Z Lin, J Zhang, Q Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

We propose a new framework for conditional image synthesis from semantic layouts of any
precision levels, ranging from pure text to a 2D semantic canvas with precise shapes. More …

被引用次数：42 相关文章所有 6 个版本

[PDF] arxiv.org

DrivingDiffusion: Layout-Guided Multi-view Driving Scenarios Video Generation with Latent Diffusion Model

X Li, Y Zhang, X Ye - European Conference on Computer Vision, 2024 - Springer

With the surge in autonomous driving technologies, the reliance on comprehensive and high-
definition bird's-eye-view (BEV) representations has become paramount. This burgeoning …

被引用次数：29 相关文章所有 2 个版本

[PDF] arxiv.org

Layoutdiffuse: Adapting foundational diffusion models for layout-to-image generation

J Cheng, X Liang, X Shi, T He, T Xiao, M Li - arXiv preprint arXiv …, 2023 - arxiv.org

Layout-to-image generation refers to the task of synthesizing photo-realistic images based
on semantic layouts. In this paper, we propose LayoutDiffuse that adapts a foundational …

被引用次数：61 相关文章所有 2 个版本

[PDF] thecvf.com

Modeling image composition for complex scene generation

Z Yang, D Liu, C Wang, J Yang… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

We present a method that achieves state-of-the-art results on challenging (few-shot) layout-
to-image generation tasks by accurately modeling textures, structures and relationships …

被引用次数：53 相关文章所有 6 个版本

[PDF] arxiv.org

Diffusion-based scene graph to image generation with masked contrastive pre-training

L Yang, Z Huang, Y Song, S Hong, G Li… - arXiv preprint arXiv …, 2022 - arxiv.org

Generating images from graph-structured inputs, such as scene graphs, is uniquely
challenging due to the difficulty of aligning nodes and connections in graphs with objects …

被引用次数：44 相关文章所有 4 个版本

高级搜索

QQ 群