Improved conditional vrnns for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org

The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

被引用次数：264 相关文章所有 14 个版本

Teleoperation methods and enhancement techniques for mobile robots: A comprehensive survey

MD Moniruzzaman, A Rassau, D Chai… - Robotics and Autonomous …, 2022 - Elsevier

In a world with rapidly growing levels of automation, robotics is playing an increasingly
significant role in every aspect of human endeavour. In particular, many types of mobile …

被引用次数：68 相关文章所有 3 个版本

[PDF] thecvf.com

Align your latents: High-resolution video synthesis with latent diffusion models

A Blattmann, R Rombach, H Ling… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding
excessive compute demands by training a diffusion model in a compressed lower …

被引用次数：482 相关文章所有 6 个版本

[PDF] arxiv.org

Stable video diffusion: Scaling latent video diffusion models to large datasets

A Blattmann, T Dockhorn, S Kulal… - arXiv preprint arXiv …, 2023 - arxiv.org

We present Stable Video Diffusion-a latent video diffusion model for high-resolution, state-of-
the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained …

被引用次数：207 相关文章所有 2 个版本

[PDF] neurips.cc

Mcvd-masked conditional video diffusion for prediction, generation, and interpolation

V Voleti, A Jolicoeur-Martineau… - Advances in neural …, 2022 - proceedings.neurips.cc

Video prediction is a challenging task. The quality of video frames from current state-of-the-
art (SOTA) generative models tends to be poor and generalization beyond the training data …

被引用次数：180 相关文章所有 9 个版本

[PDF] thecvf.com

Simvp: Simpler yet better video prediction

Z Gao, C Tan, L Wu, SZ Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com

Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …

被引用次数：155 相关文章所有 6 个版本

[PDF] neurips.cc

Gaudi: A neural architect for immersive 3d scene generation

MA Bautista, P Guo, S Abnar… - Advances in …, 2022 - proceedings.neurips.cc

We introduce GAUDI, a generative model capable of capturing the distribution of complex
and realistic 3D scenes that can be rendered immersively from a moving camera. We tackle …

被引用次数：105 相关文章所有 6 个版本

[PDF] thecvf.com

Simda: Simple diffusion adapter for efficient video generation

Z Xing, Q Dai, H Hu, Z Wu… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

The recent wave of AI-generated content has witnessed the great development and success
of Text-to-Image (T2I) technologies. By contrast Text-to-Video (T2V) still falls short of …

被引用次数：33 相关文章所有 3 个版本

[PDF] mdpi.com

Diffusion probabilistic modeling for video generation

R Yang, P Srivastava, S Mandt - Entropy, 2023 - mdpi.com

Denoising diffusion probabilistic models are a promising new class of generative models
that mark a milestone in high-quality image generation. This paper showcases their ability to …

被引用次数：180 相关文章所有 9 个版本

[PDF] arxiv.org

Predrnn: A recurrent neural network for spatiotemporal predictive learning

Y Wang, H Wu, J Zhang, Z Gao, J Wang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org

The predictive learning of spatiotemporal sequences aims to generate future images by
learning from the historical context, where the visual dynamics are believed to have modular …

被引用次数：296 相关文章所有 6 个版本

高级搜索

QQ 群