GaussCtrl: multi-view consistent text-driven 3D Gaussian splatting editing

J Wu, JW Bian, X Li, G Wang, I Reid, P Torr… - arXiv preprint arXiv …, 2024 - arxiv.org
We propose GaussCtrl, a text-driven method to edit a 3D scene reconstructed by the 3D
Gaussian Splatting (3DGS). Our method first renders a collection of images by using the …

LLMs Meet Multimodal Generation and Editing: A Survey

Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …

A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models

X Shuai, H Ding, X Ma, R Tu, YG Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org
Image editing aims to edit the given synthetic or real image to meet the specific requirements
from users. It is widely studied in recent years as a promising and challenging field of …

GeoDiffuser: Geometry-Based Image Editing with Diffusion Models

R Sajnani, J Vanbaar, J Min, K Katyal… - arXiv preprint arXiv …, 2024 - arxiv.org
The success of image generative models has enabled us to build methods that can edit
images based on text or other user input. However, these methods are bespoke, imprecise …

GenHeld: Generating and Editing Handheld Objects

C Min, S Sridhar - arXiv preprint arXiv:2406.05059, 2024 - arxiv.org
Grasping is an important human activity that has long been studied in robotics, computer
vision, and cognitive science. Most existing works study grasping from the perspective of …

RegionDrag: Fast Region-Based Image Editing with Diffusion Models

J Lu, X Li, K Han - arXiv preprint arXiv:2407.18247, 2024 - arxiv.org
Point-drag-based image editing methods, like DragDiffusion, have attracted significant
attention. However, point-drag-based approaches suffer from computational overhead and …

Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner

X Cui, P Li, Z Li, X Liu, Y Zou, Z He - arXiv preprint arXiv:2406.00432, 2024 - arxiv.org
Flexible and accurate drag-based editing is a challenging task that has recently garnered
significant attention. Current methods typically model this problem as automatically …

DragText: Rethinking Text Embedding in Point-based Image Editing

G Choi, T Jeong, S Hong, J Joo, SJ Hwang - arXiv preprint arXiv …, 2024 - arxiv.org
Point-based image editing enables accurate and flexible control through content dragging.
However, the role of text embedding in the editing process has not been thoroughly …

FastDrag: Manipulate Anything in One Step

X Zhao, J Guan, C Fan, D Xu, Y Lin, H Pan… - arXiv preprint arXiv …, 2024 - arxiv.org
Drag-based image editing using generative models provides precise control over image
contents, enabling users to manipulate anything in an image with a few clicks. However …