Smartbrush: Text and shape guided object inpainting with diffusion model

X Chen, L Huang, Y Liu, Y Shen… - Proceedings of the …, 2024 - openaccess.thecvf.com

This work presents AnyDoor a diffusion-based image generator with the power to teleport
target objects to new scenes at user-specified locations with desired shapes. Instead of …

被引用次数：191 相关文章所有 3 个版本

Videocomposer: Compositional video synthesis with motion controllability

X Wang, H Yuan, S Zhang, D Chen… - Advances in …, 2024 - proceedings.neurips.cc

The pursuit of controllability as a higher standard of visual content creation has yielded
remarkable progress in customizable image synthesis. However, achieving controllable …

被引用次数：264 相关文章所有 6 个版本

[PDF] arxiv.org

Composer: Creative and controllable image synthesis with composable conditions

L Huang, D Chen, Y Liu, Y Shen, D Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent large-scale generative models learned on big data are capable of synthesizing
incredible images yet suffer from limited controllability. This work offers a new generation …

被引用次数：236 相关文章所有 5 个版本

[PDF] thecvf.com

Emu edit: Precise image editing via recognition and generation tasks

S Sheynin, A Polyak, U Singer… - Proceedings of the …, 2024 - openaccess.thecvf.com

Instruction-based image editing holds immense potential for a variety of applications as it
enables users to perform any editing operation using a natural language instruction …

被引用次数：77 相关文章所有 5 个版本

[PDF] arxiv.org

A task is worth one word: Learning with task prompts for high-quality versatile image inpainting

J Zhuang, Y Zeng, W Liu, C Yuan, K Chen - European Conference on …, 2025 - Springer

Advancing image inpainting is challenging as it requires filling user-specified regions for
various intents, such as background filling and object synthesis. Existing approaches focus …

被引用次数：37 相关文章所有 2 个版本

[PDF] arxiv.org

Dmv3d: Denoising multi-view diffusion using 3d large reconstruction model

Y Xu, H Tan, F Luan, S Bi, P Wang, J Li, Z Shi… - arXiv preprint arXiv …, 2023 - arxiv.org

We propose\textbf {DMV3D}, a novel 3D generation approach that uses a transformer-based
3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model …

被引用次数：115 相关文章所有 3 个版本

[PDF] thecvf.com

Objectstitch: Object compositing with diffusion model

Y Song, Z Zhang, Z Lin, S Cohen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Object compositing based on 2D images is a challenging problem since it typically involves
multiple processing stages such as color harmonization, geometry correction and shadow …

被引用次数：71 相关文章所有 4 个版本

[PDF] arxiv.org

Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

被引用次数：61 相关文章所有 2 个版本

[PDF] arxiv.org

Diffusion models for imperceptible and transferable adversarial attack

J Chen, H Chen, K Chen, Y Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Many existing adversarial attacks generate-norm perturbations on image RGB space.
Despite some achievements in transferability and attack success rate, the crafted adversarial …

被引用次数：47 相关文章所有 3 个版本

[PDF] thecvf.com

Towards language-driven video inpainting via multimodal large language models

J Wu, X Li, C Si, S Zhou, J Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com

We introduce a new task--language-driven video inpainting which uses natural language
instructions to guide the inpainting process. This approach overcomes the limitations of …

被引用次数：19 相关文章所有 3 个版本

高级搜索

QQ 群