Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

[PDF][PDF] Controlling vision-language models for universal image restoration

Z Luo, FK Gustafsson, Z Zhao, J Sjölund… - arXiv preprint arXiv …, 2023 - researchgate.net
Vision-language models such as CLIP have shown great impact on diverse downstream
tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such …

Controlling vision-language models for multi-task image restoration

Z Luo, FK Gustafsson, Z Zhao, J Sjölund… - International …, 2024 - diva-portal.org
Vision-language models such as CLIP have shown great impact on diverse downstream
tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such …

Responsible visual editing

M Ni, Y Shen, L Zhang, W Zuo - European Conference on Computer …, 2025 - Springer
With the recent advancements in visual synthesis, there is a growing risk of encountering
synthesized images with detrimental effects, such as hate, discrimination, and privacy …

Deep learning-based image and video inpainting: A survey

W Quan, J Chen, Y Liu, DM Yan, P Wonka - International Journal of …, 2024 - Springer
Image and video inpainting is a classic problem in computer vision and computer graphics,
aiming to fill in the plausible and realistic content in the missing areas of images and videos …

Vision+ language applications: A survey

Y Zhou, N Shimada - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Text-to-image generation has attracted significant interest from researchers and practitioners
in recent years due to its widespread and diverse applications across various industries …

Leveraging vision-language prompts for real-world image restoration and enhancement

Y Wei, Y Zhang, K Li, F Wang, S Tang… - Computer Vision and …, 2025 - Elsevier
Significant advancements have been made in image restoration methods aimed at removing
adverse weather effects. However, due to natural constraints, it is challenging to collect real …

Architext: Language-driven generative architecture design

T Galanos, A Liapis, GN Yannakakis - arXiv preprint arXiv:2303.07519, 2023 - arxiv.org
Architectural design is a highly complex practice that involves a wide diversity of disciplines,
technologies, proprietary design software, expertise, and an almost infinite number of …

Sair: Learning semantic-aware implicit representation

C Zhang, X Li, Q Guo, S Wang - European Conference on Computer …, 2025 - Springer
Implicit representation of an image can map arbitrary coordinates in the continuous domain
to their corresponding color values, presenting a powerful capability for image …

Image inpainting by bidirectional information flow on texture and structure

J Lian, J Zhang, H Zhang, Y Chen, J Zhang, J Liu - Signal Processing, 2025 - Elsevier
Image inpainting aims to recover damaged regions of a corrupted image and maintain the
integrity of the structure and texture within the filled regions. Previous popular approaches …