NUWA-LIP: language-guided image inpainting with defect-free VQGAN

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

被引用次数：263 相关文章所有 11 个版本

[PDF] researchgate.net

[PDF][PDF] Controlling vision-language models for universal image restoration

Z Luo, FK Gustafsson, Z Zhao, J Sjölund… - arXiv preprint arXiv …, 2023 - researchgate.net

Vision-language models such as CLIP have shown great impact on diverse downstream
tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such …

被引用次数：75 相关文章所有 2 个版本

[PDF] diva-portal.org

Controlling vision-language models for multi-task image restoration

Z Luo, FK Gustafsson, Z Zhao, J Sjölund… - International …, 2024 - diva-portal.org

Vision-language models such as CLIP have shown great impact on diverse downstream
tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such …

被引用次数：12 相关文章

[PDF] arxiv.org

Responsible visual editing

M Ni, Y Shen, L Zhang, W Zuo - European Conference on Computer …, 2025 - Springer

With the recent advancements in visual synthesis, there is a growing risk of encountering
synthesized images with detrimental effects, such as hate, discrimination, and privacy …

被引用次数：3 相关文章所有 2 个版本

[PDF] github.io

Deep learning-based image and video inpainting: A survey

W Quan, J Chen, Y Liu, DM Yan, P Wonka - International Journal of …, 2024 - Springer

Image and video inpainting is a classic problem in computer vision and computer graphics,
aiming to fill in the plausible and realistic content in the missing areas of images and videos …

被引用次数：30 相关文章所有 6 个版本

[PDF] thecvf.com

Vision+ language applications: A survey

Y Zhou, N Shimada - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

Text-to-image generation has attracted significant interest from researchers and practitioners
in recent years due to its widespread and diverse applications across various industries …

被引用次数：10 相关文章所有 6 个版本

Leveraging vision-language prompts for real-world image restoration and enhancement

Y Wei, Y Zhang, K Li, F Wang, S Tang… - Computer Vision and …, 2025 - Elsevier

Significant advancements have been made in image restoration methods aimed at removing
adverse weather effects. However, due to natural constraints, it is challenging to collect real …

被引用次数：2 相关文章

[PDF] arxiv.org

Architext: Language-driven generative architecture design

T Galanos, A Liapis, GN Yannakakis - arXiv preprint arXiv:2303.07519, 2023 - arxiv.org

Architectural design is a highly complex practice that involves a wide diversity of disciplines,
technologies, proprietary design software, expertise, and an almost infinite number of …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Sair: Learning semantic-aware implicit representation

C Zhang, X Li, Q Guo, S Wang - European Conference on Computer …, 2025 - Springer

Implicit representation of an image can map arbitrary coordinates in the continuous domain
to their corresponding color values, presenting a powerful capability for image …

被引用次数：3 相关文章所有 3 个版本

[PDF] researchgate.net

Image inpainting by bidirectional information flow on texture and structure

J Lian, J Zhang, H Zhang, Y Chen, J Zhang, J Liu - Signal Processing, 2025 - Elsevier

Image inpainting aims to recover damaged regions of a corrupted image and maintain the
integrity of the structure and texture within the filled regions. Previous popular approaches …

被引用次数：1 相关文章所有 4 个版本

高级搜索

QQ 群