A comprehensive survey on segment anything model for vision and beyond

C Zhang, L Liu, Y Cui, G Huang, W Lin, Y Yang… - arXiv preprint arXiv …, 2023 - arxiv.org
Artificial intelligence (AI) is evolving towards artificial general intelligence, which refers to the
ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence …

On distillation of guided diffusion models

C Meng, R Rombach, R Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Classifier-free guided diffusion models have recently been shown to be highly effective at
high-resolution image generation, and they have been widely used in large-scale diffusion …

Paint by example: Exemplar-based image editing with diffusion models

B Yang, S Gu, B Zhang, T Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Language-guided image editing has achieved great success recently. In this paper,
we investigate exemplar-guided image editing for more precise control. We achieve this …

Decomposing nerf for editing via feature field distillation

S Kobayashi, E Matsumoto… - Advances in Neural …, 2022 - proceedings.neurips.cc
Emerging neural radiance fields (NeRF) are a promising scene representation for computer
graphics, enabling high-quality 3D reconstruction and novel view synthesis from image …

Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models

R Huang, J Huang, D Yang, Y Ren… - International …, 2023 - proceedings.mlr.press
Large-scale multimodal generative modeling has created milestones in text-to-image and
text-to-video generation. Its application to audio still lags behind for two main reasons: the …

Improving diffusion models for inverse problems using manifold constraints

H Chung, B Sim, D Ryu, JC Ye - Advances in Neural …, 2022 - proceedings.neurips.cc
Recently, diffusion models have been used to solve various inverse problems in an
unsupervised manner with appropriate modifications to the sampling process. However, the …

Repaint: Inpainting using denoising diffusion probabilistic models

A Lugmayr, M Danelljan, A Romero… - Proceedings of the …, 2022 - openaccess.thecvf.com
Free-form inpainting is the task of adding new content to an image in the regions specified
by an arbitrary binary mask. Most existing approaches train for a certain distribution of …

Diffrf: Rendering-guided 3d radiance field diffusion

N Müller, Y Siddiqui, L Porzi, SR Bulo… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce DiffRF, a novel approach for 3D radiance field synthesis based on denoising
diffusion probabilistic models. While existing diffusion-based methods operate on images …

High-resolution image synthesis with latent diffusion models

R Rombach, A Blattmann, D Lorenz… - Proceedings of the …, 2022 - openaccess.thecvf.com
By decomposing the image formation process into a sequential application of denoising
autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image …

Mat: Mask-aware transformer for large hole image inpainting

W Li, Z Lin, K Zhou, L Qi, Y Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recent studies have shown the importance of modeling long-range interactions in the
inpainting problem. To achieve this goal, existing approaches exploit either standalone …