Sound-guided semantic image manipulation

SH Lee, W Roh, W Byeon, SH Yoon… - Proceedings of the …, 2022 - openaccess.thecvf.com
The recent success of the generative model shows that leveraging the multi-modal
embedding space can manipulate an image using text information. However, manipulating …

Spaceedit: Learning a unified editing space for open-domain image color editing

J Shi, N Xu, H Zheng, A Smith… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recently, large pretrained models (eg, BERT, StyleGAN, CLIP) show great knowledge
transfer and generalization capability on various downstream tasks within their domains …

Manitrans: Entity-level text-guided image manipulation via token-wise semantic alignment and generation

J Wang, G Lu, H Xu, Z Li, C Xu… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Existing text-guided image manipulation methods aim to modify the appearance of the
image or to edit a few objects in a virtual or simple scenario, which is far from practical …

Adversarial and isotropic gradient augmentation for image retrieval with text feedback

F Huang, L Zhang, Y Zhou… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Image Retrieval with Text Feedback (IRTF) is an emerging research topic where the query
consists of an image and a text expressing a requested attribute modification. The goal is to …

Disentangling Content and Motion for Text-Based Neural Video Manipulation

L Karacan, T Kerimoğlu, I Inan, T Birdal… - arXiv preprint arXiv …, 2022 - arxiv.org
Giving machines the ability to imagine possible new objects or scenes from linguistic
descriptions and produce their realistic renderings is arguably one of the most challenging …

Language Driven Image Editing via Transformers

R Santos, A Branco, J Silva - 2022 IEEE 34th International …, 2022 - ieeexplore.ieee.org
With the emergence of specifically tailored neural architectures that cope with both
modalities, cross-modal language and image processing has attracted increasing attention …

Cost-Effective Language Driven Image Editing with LX-DRIM

R Santos, A Branco, J Silva - Proceedings of the First Workshop …, 2022 - aclanthology.org
Cross-modal language and image processing is envisaged as a way to improve language
understanding by resorting to visual grounding, but only recently, with the emergence of …

Weakly Supervised Neuro-Symbolic Image Manipulation via Multi-Hop Complex Instructions

H Singh, P Garg, M Gupta, K Shah, AK Mondal… - openreview.net
We are interested in image manipulation via natural language text–a task that is extremely
useful for multiple AI applications but requires complex reasoning over multi-modal spaces …