Objectstitch: Object compositing with diffusion model

X Chen, L Huang, Y Liu, Y Shen… - Proceedings of the …, 2024 - openaccess.thecvf.com

This work presents AnyDoor a diffusion-based image generator with the power to teleport
target objects to new scenes at user-specified locations with desired shapes. Instead of …

被引用次数：191 相关文章所有 3 个版本

[PDF] thecvf.com

Animate anyone: Consistent and controllable image-to-video synthesis for character animation

L Hu - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Character Animation aims to generating character videos from still images through driving
signals. Currently diffusion models have become the mainstream in visual generation …

被引用次数：233 相关文章所有 3 个版本

[PDF] thecvf.com

Videobooth: Diffusion-based video generation with image prompts

Y Jiang, T Wu, S Yang, C Si, D Lin… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-driven video generation witnesses rapid progress. However merely using text prompts
is not enough to depict the desired subject appearance that accurately aligns with users' …

被引用次数：37 相关文章所有 4 个版本

[PDF] arxiv.org

Video understanding with large language models: A survey

Y Tang, J Bi, S Xu, L Song, S Liang, T Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

With the burgeoning growth of online video platforms and the escalating volume of video
content, the demand for proficient video understanding tools has intensified markedly. Given …

被引用次数：47 相关文章所有 2 个版本

[PDF] arxiv.org

Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

被引用次数：61 相关文章所有 2 个版本

[PDF] thecvf.com

Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry... for now

A Sarkar, H Mai, A Mahapatra… - Proceedings of the …, 2024 - openaccess.thecvf.com

Generative models can produce impressively realistic images. This paper demonstrates that
generated images have geometric features different from those of real images. We build a …

被引用次数：24 相关文章所有 4 个版本

[PDF] thecvf.com

Dl3dv-10k: A large-scale scene dataset for deep learning-based 3d vision

L Ling, Y Sheng, Z Tu, W Zhao, C Xin… - Proceedings of the …, 2024 - openaccess.thecvf.com

We have witnessed significant progress in deep learning-based 3D vision ranging from
neural radiance field (NeRF) based 3D representation learning to applications in novel view …

被引用次数：41 相关文章所有 4 个版本

[PDF] thecvf.com

Imprint: Generative object compositing by learning identity-preserving representation

Y Song, Z Zhang, Z Lin, S Cohen… - Proceedings of the …, 2024 - openaccess.thecvf.com

Generative object compositing emerges as a promising new avenue for compositional
image editing. However the requirement of object identity preservation poses a significant …

被引用次数：16 相关文章所有 3 个版本

[PDF] acm.org

Llmr: Real-time prompting of interactive worlds using large language models

F De La Torre, CM Fang, H Huang… - Proceedings of the CHI …, 2024 - dl.acm.org

We present Large Language Model for Mixed Reality (LLMR), a framework for the real-time
creation and modification of interactive Mixed Reality experiences using LLMs. LLMR …

被引用次数：32 相关文章所有 4 个版本

[PDF] thecvf.com

Diffusion Handles Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D

K Pandey, P Guerrero, M Gadelha… - Proceedings of the …, 2024 - openaccess.thecvf.com

Diffusion handles is a novel approach to enable 3D object edits on diffusion images
requiring only existing pre-trained diffusion models depth estimation without any fine-tuning …

被引用次数：12 相关文章所有 3 个版本

高级搜索

QQ 群