Anydoor: Zero-shot object-level image customization

X Chen, L Huang, Y Liu, Y Shen… - Proceedings of the …, 2024 - openaccess.thecvf.com
This work presents AnyDoor a diffusion-based image generator with the power to teleport
target objects to new scenes at user-specified locations with desired shapes. Instead of …

Videocomposer: Compositional video synthesis with motion controllability

X Wang, H Yuan, S Zhang, D Chen… - Advances in …, 2024 - proceedings.neurips.cc
The pursuit of controllability as a higher standard of visual content creation has yielded
remarkable progress in customizable image synthesis. However, achieving controllable …

Composer: Creative and controllable image synthesis with composable conditions

L Huang, D Chen, Y Liu, Y Shen, D Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent large-scale generative models learned on big data are capable of synthesizing
incredible images yet suffer from limited controllability. This work offers a new generation …

Emu edit: Precise image editing via recognition and generation tasks

S Sheynin, A Polyak, U Singer… - Proceedings of the …, 2024 - openaccess.thecvf.com
Instruction-based image editing holds immense potential for a variety of applications as it
enables users to perform any editing operation using a natural language instruction …

A task is worth one word: Learning with task prompts for high-quality versatile image inpainting

J Zhuang, Y Zeng, W Liu, C Yuan, K Chen - European Conference on …, 2025 - Springer
Advancing image inpainting is challenging as it requires filling user-specified regions for
various intents, such as background filling and object synthesis. Existing approaches focus …

Dmv3d: Denoising multi-view diffusion using 3d large reconstruction model

Y Xu, H Tan, F Luan, S Bi, P Wang, J Li, Z Shi… - arXiv preprint arXiv …, 2023 - arxiv.org
We propose\textbf {DMV3D}, a novel 3D generation approach that uses a transformer-based
3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model …

Objectstitch: Object compositing with diffusion model

Y Song, Z Zhang, Z Lin, S Cohen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Object compositing based on 2D images is a challenging problem since it typically involves
multiple processing stages such as color harmonization, geometry correction and shadow …

Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Diffusion models for imperceptible and transferable adversarial attack

J Chen, H Chen, K Chen, Y Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Many existing adversarial attacks generate-norm perturbations on image RGB space.
Despite some achievements in transferability and attack success rate, the crafted adversarial …

Towards language-driven video inpainting via multimodal large language models

J Wu, X Li, C Si, S Zhou, J Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce a new task--language-driven video inpainting which uses natural language
instructions to guide the inpainting process. This approach overcomes the limitations of …