Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion

X Fan, A Bhattad, R Krishna - arXiv preprint arXiv:2403.14617, 2024 - arxiv.org
We introduce Videoshop, a training-free video editing algorithm for localized semantic edits.
Videoshop allows users to use any editing software, including Photoshop and generative …

Smart Vision-Language Reasoners

D Roberts, L Roberts - arXiv preprint arXiv:2407.04212, 2024 - arxiv.org
In this article, we investigate vision-language models (VLM) as reasoners. The ability to form
abstractions underlies mathematical reasoning, problem-solving, and other Math AI tasks …

Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos

H Alzayer, Z Xia, X Zhang, E Shechtman… - arXiv preprint arXiv …, 2024 - arxiv.org
We propose a generative model that, given a coarsely edited image, synthesizes a
photorealistic output that follows the prescribed layout. Our method transfers fine details from …

Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models

Z Wu, Y Rubanova, R Kabra, DA Hudson… - arXiv preprint arXiv …, 2024 - arxiv.org
We address the problem of multi-object 3D pose control in image diffusion models. Instead
of conditioning on a sequence of text tokens, we propose to use a set of per-object …

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Q Zhang, Y Xu, C Wang, HY Lee, G Wetzstein… - arXiv preprint arXiv …, 2024 - arxiv.org
Scene image editing is crucial for entertainment, photography, and advertising design.
Existing methods solely focus on either 2D individual object or 3D global scene editing. This …

DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation

S Shen, J Xu, Y Yuan, X Yang, Q Shen… - arXiv preprint arXiv …, 2024 - arxiv.org
User-friendly 3D object editing is a challenging task that has attracted significant attention
recently. The limitations of direct 3D object editing without 2D prior knowledge have …

VIDEOHANDLES: EDITING 3D OBJECT COMPOSI-TIONS IN VIDEOS USING VIDEO GENERATIVE PRIORS

V Edited - openreview.net
Generative methods for image and video editing use generative models as priors to perform
edits despite incomplete information, such as changing the composition of 3D objects shown …