Text-driven video generation witnesses rapid progress. However merely using text prompts is not enough to depict the desired subject appearance that accurately aligns with users' …
Diffusion models gain increasing popularity for their generative capabilities. Recently, there have been surging needs to generate customized images by inverting diffusion models from …
J Jain, J Yang, H Shi - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Humans possess the remarkable skill of Visual Perception the ability to see and understand the seen helping them make sense of the visual world and in turn reason. Multimodal Large …
Recently diffusion models have made remarkable progress in text-to-image (T2I) generation synthesizing images with high fidelity and diverse contents. Despite this advancement latent …
DY Chen, AK Bhunia, S Koley, A Sain… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this paper we democratise caricature generation empowering individuals to effortlessly craft personalised caricatures with just a photo and a conceptual sketch. Our objective is to …
Y Hu, Y Lin, W Wang, Y Zhao, Y Wei, H Shi - European Conference on …, 2025 - Springer
Existing natural image matting algorithms inevitably have flaws in their predictions on difficult cases, and their one-step prediction manner cannot further correct these errors. In …
The remarkable generative capabilities of diffusion models have motivated extensive research in both image and video editing. Compared to video editing which faces additional …
The remarkable efficacy of text-to-image diffusion models has motivated extensive exploration of their potential application in video domains. Zero-shot methods seek to extend …
Image and video synthesis has become a blooming topic in computer vision and machine learning communities along with the developments of deep generative models, due to its …