Diffeditor: Boosting accuracy and flexibility on diffusion-based image editing

J Wu, JW Bian, X Li, G Wang, I Reid, P Torr… - arXiv preprint arXiv …, 2024 - arxiv.org

We propose GaussCtrl, a text-driven method to edit a 3D scene reconstructed by the 3D
Gaussian Splatting (3DGS). Our method first renders a collection of images by using the …

被引用次数：5 相关文章所有 4 个版本

[PDF] arxiv.org

LLMs Meet Multimodal Generation and Editing: A Survey

Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models

X Shuai, H Ding, X Ma, R Tu, YG Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org

Image editing aims to edit the given synthetic or real image to meet the specific requirements
from users. It is widely studied in recent years as a promising and challenging field of …

被引用次数：2 相关文章

[PDF] arxiv.org

GeoDiffuser: Geometry-Based Image Editing with Diffusion Models

R Sajnani, J Vanbaar, J Min, K Katyal… - arXiv preprint arXiv …, 2024 - arxiv.org

The success of image generative models has enabled us to build methods that can edit
images based on text or other user input. However, these methods are bespoke, imprecise …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

GenHeld: Generating and Editing Handheld Objects

C Min, S Sridhar - arXiv preprint arXiv:2406.05059, 2024 - arxiv.org

Grasping is an important human activity that has long been studied in robotics, computer
vision, and cognitive science. Most existing works study grasping from the perspective of …

相关文章所有 2 个版本

[PDF] arxiv.org

RegionDrag: Fast Region-Based Image Editing with Diffusion Models

J Lu, X Li, K Han - arXiv preprint arXiv:2407.18247, 2024 - arxiv.org

Point-drag-based image editing methods, like DragDiffusion, have attracted significant
attention. However, point-drag-based approaches suffer from computational overhead and …

Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner

X Cui, P Li, Z Li, X Liu, Y Zou, Z He - arXiv preprint arXiv:2406.00432, 2024 - arxiv.org

Flexible and accurate drag-based editing is a challenging task that has recently garnered
significant attention. Current methods typically model this problem as automatically …

相关文章所有 2 个版本

[PDF] arxiv.org

DragText: Rethinking Text Embedding in Point-based Image Editing

G Choi, T Jeong, S Hong, J Joo, SJ Hwang - arXiv preprint arXiv …, 2024 - arxiv.org

Point-based image editing enables accurate and flexible control through content dragging.
However, the role of text embedding in the editing process has not been thoroughly …

相关文章所有 2 个版本

[PDF] arxiv.org

FastDrag: Manipulate Anything in One Step

X Zhao, J Guan, C Fan, D Xu, Y Lin, H Pan… - arXiv preprint arXiv …, 2024 - arxiv.org

Drag-based image editing using generative models provides precise control over image
contents, enabling users to manipulate anything in an image with a few clicks. However …

相关文章所有 2 个版本

高级搜索

QQ 群