Image sculpting: Precise object editing with 3d geometry control

X Fan, A Bhattad, R Krishna - arXiv preprint arXiv:2403.14617, 2024 - arxiv.org

We introduce Videoshop, a training-free video editing algorithm for localized semantic edits.
Videoshop allows users to use any editing software, including Photoshop and generative …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Smart Vision-Language Reasoners

D Roberts, L Roberts - arXiv preprint arXiv:2407.04212, 2024 - arxiv.org

In this article, we investigate vision-language models (VLM) as reasoners. The ability to form
abstractions underlies mathematical reasoning, problem-solving, and other Math AI tasks …

被引用次数：1 相关文章

[PDF] arxiv.org

Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos

H Alzayer, Z Xia, X Zhang, E Shechtman… - arXiv preprint arXiv …, 2024 - arxiv.org

We propose a generative model that, given a coarsely edited image, synthesizes a
photorealistic output that follows the prescribed layout. Our method transfers fine details from …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models

Z Wu, Y Rubanova, R Kabra, DA Hudson… - arXiv preprint arXiv …, 2024 - arxiv.org

We address the problem of multi-object 3D pose control in image diffusion models. Instead
of conditioning on a sequence of text tokens, we propose to use a set of per-object …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Q Zhang, Y Xu, C Wang, HY Lee, G Wetzstein… - arXiv preprint arXiv …, 2024 - arxiv.org

Scene image editing is crucial for entertainment, photography, and advertising design.
Existing methods solely focus on either 2D individual object or 3D global scene editing. This …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation

S Shen, J Xu, Y Yuan, X Yang, Q Shen… - arXiv preprint arXiv …, 2024 - arxiv.org

User-friendly 3D object editing is a challenging task that has attracted significant attention
recently. The limitations of direct 3D object editing without 2D prior knowledge have …

VIDEOHANDLES: EDITING 3D OBJECT COMPOSI-TIONS IN VIDEOS USING VIDEO GENERATIVE PRIORS

V Edited - openreview.net

Generative methods for image and video editing use generative models as priors to perform
edits despite incomplete information, such as changing the composition of 3D objects shown …

高级搜索

QQ 群