Probing the 3d awareness of visual foundation models

M El Banani, A Raj, KK Maninis, A Kar… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advances in large-scale pretraining have yielded visual foundation models with
strong capabilities. Not only can recent models generalize to arbitrary images for their …

Consistent3d: Towards consistent high-fidelity text-to-3d generation with deterministic sampling prior

Z Wu, P Zhou, X Yi, X Yuan… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Score distillation sampling (SDS) and its variants have greatly boosted the development of
text-to-3D generation but are vulnerable to geometry collapse and poor textures yet. To …

Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry... for now

A Sarkar, H Mai, A Mahapatra… - Proceedings of the …, 2024 - openaccess.thecvf.com
Generative models can produce impressively realistic images. This paper demonstrates that
generated images have geometric features different from those of real images. We build a …

Beta diffusion

M Zhou, T Chen, Z Wang… - Advances in Neural …, 2024 - proceedings.neurips.cc
We introduce beta diffusion, a novel generative modeling method that integrates demasking
and denoising to generate data within bounded ranges. Using scaled and shifted beta …

Generative models: What do they know? do they know things? let's find out!

X Du, N Kolkin, G Shakhnarovich, A Bhattad - arXiv preprint arXiv …, 2023 - arxiv.org
Generative models excel at mimicking real scenes, suggesting they might inherently encode
important intrinsic scene properties. In this paper, we aim to explore the following key …

Editanything: Empowering unparalleled flexibility in image editing and generation

S Gao, Z Lin, X Xie, P Zhou, MM Cheng… - Proceedings of the 31st …, 2023 - dl.acm.org
Image editing plays a vital role in computer vision field, aiming to realistically manipulate
images while ensuring seamless integration. It finds numerous applications across various …

Boosting Diffusion Models with Moving Average Sampling in Frequency Domain

Y Qian, Q Cai, Y Pan, Y Li, T Yao… - Proceedings of the …, 2024 - openaccess.thecvf.com
Diffusion models have recently brought a powerful revolution in image generation. Despite
showing impressive generative capabilities most of these models rely on the current sample …

Emergence of hidden capabilities: Exploring learning dynamics in concept space

CF Park, M Okawa, A Lee, ES Lubana… - arXiv preprint arXiv …, 2024 - arxiv.org
Modern generative models demonstrate impressive capabilities, likely stemming from an
ability to identify and manipulate abstract concepts underlying their training data. However …

Doubly Abductive Counterfactual Inference for Text-based Image Editing

X Song, J Cui, H Zhang, J Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com
We study text-based image editing (TBIE) of a single image by counterfactual inference
because it is an elegant formulation to precisely address the requirement: the edited image …

A closer look at time steps is worthy of triple speed-up for diffusion model training

K Wang, M Shi, Y Zhou, Z Li, Z Yuan, Y Shang… - arXiv preprint arXiv …, 2024 - arxiv.org
Training diffusion models is always a computation-intensive task. In this paper, we introduce
a novel speed-up method for diffusion model training, called, which is based on a closer …