In this work we present Vlogger a generic AI system for generating a minute-level video blog (ie vlog) of user descriptions. Different from short videos with a few seconds vlog often …
It is desirable but challenging to generate content-rich long videos in the scale of minutes. Autoregressive large language models (LLMs) have achieved great success in generating …
Image-to-video (I2V) generation aims to use the initial frame (alongside a text prompt) to create a video sequence. A grand challenge in I2V generation is to maintain visual …
We present ART-V an efficient framework for auto-regressive video generation with diffusion models. Unlike existing methods that generate entire videos in one-shot ART-V generates a …
Recent advancements in video generation have primarily leveraged diffusion models for short-duration content. However, these approaches often fall short in modeling complex …
Video diffusion models have made substantial progress in various video generation applications. However, training models for long video generation tasks require significant …
Text-to-video diffusion models enable the generation of high-quality videos that follow text instructions, making it easy to create diverse and individual content. However, existing …
Comprehensive and constructive evaluation protocols play an important role in the development of sophisticated text-to-video (T2V) generation models. Existing evaluation …
Diffusion model has demonstrated remarkable capability in video generation, which further sparks interest in introducing trajectory control into the generation process. While existing …