As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines everywhere because of its ability to analyze and create text, images, and beyond. With such …
We propose a method for editing images from human instructions: given an input image and a written instruction that tells the model what to do, our model follows these instructions to …
S Chen, P Sun, Y Song, P Luo - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We propose DiffusionDet, a new framework that formulates object detection as a denoising diffusion process from noisy boxes to object boxes. During the training stage, object boxes …
Large-scale diffusion-based generative models have led to breakthroughs in text- conditioned high-resolution image synthesis. Starting from random noise, such text-to-image …
This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools. The former is defined as decomposing a potentially …
Abstract We introduce Inference-Time Intervention (ITI), a technique designed to enhance the" truthfulness" of large language models (LLMs). ITI operates by shifting model activations …
This paper presents a 3D diffusion model that automatically generates 3D digital avatars represented as neural radiance fields (NeRFs). A significant challenge for 3D diffusion is …
X Liu, C Gong, Q Liu - arXiv preprint arXiv:2209.03003, 2022 - arxiv.org
We present rectified flow, a surprisingly simple approach to learning (neural) ordinary differential equation (ODE) models to transport between two empirically observed …
Deep generative models have unlocked another profound realm of human creativity. By capturing and generalizing patterns within data, we have entered the epoch of all …