Hierarchical long-term video prediction without supervision

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org

The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

被引用次数：258 相关文章所有 17 个版本

Teleoperation methods and enhancement techniques for mobile robots: A comprehensive survey

MD Moniruzzaman, A Rassau, D Chai… - Robotics and Autonomous …, 2022 - Elsevier

In a world with rapidly growing levels of automation, robotics is playing an increasingly
significant role in every aspect of human endeavour. In particular, many types of mobile …

被引用次数：67 相关文章所有 3 个版本

[PDF] neurips.cc

Flexible diffusion modeling of long videos

W Harvey, S Naderiparizi, V Masrani… - Advances in …, 2022 - proceedings.neurips.cc

We present a framework for video modeling based on denoising diffusion probabilistic
models that produces long-duration video completions in a variety of realistic environments …

被引用次数：181 相关文章所有 8 个版本

[PDF] thecvf.com

Simvp: Simpler yet better video prediction

Z Gao, C Tan, L Wu, SZ Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com

Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …

被引用次数：149 相关文章所有 8 个版本

[PDF] mdpi.com

Diffusion probabilistic modeling for video generation

R Yang, P Srivastava, S Mandt - Entropy, 2023 - mdpi.com

Denoising diffusion probabilistic models are a promising new class of generative models
that mark a milestone in high-quality image generation. This paper showcases their ability to …

被引用次数：174 相关文章所有 10 个版本

[PDF] arxiv.org

Predrnn: A recurrent neural network for spatiotemporal predictive learning

Y Wang, H Wu, J Zhang, Z Gao, J Wang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org

The predictive learning of spatiotemporal sequences aims to generate future images by
learning from the historical context, where the visual dynamics are believed to have modular …

被引用次数：285 相关文章所有 6 个版本

[PDF] thecvf.com

Tdan: Temporally-deformable alignment network for video super-resolution

Y Tian, Y Zhang, Y Fu, C Xu - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com

Video super-resolution (VSR) aims to restore a photo-realistic high-resolution (HR) video
frame from both its corresponding low-resolution (LR) frame (reference frame) and multiple …

被引用次数：606 相关文章所有 13 个版本

[PDF] thecvf.com

Hierarchical cross-modal talking face generation with dynamic pixel-wise loss

L Chen, RK Maddox, Z Duan… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

We devise a cascade GAN approach to generate talking face video, which is robust to
different face shapes, view angles, facial characteristics, and noisy audio conditions. Instead …

被引用次数：385 相关文章所有 12 个版本

[PDF] openreview.net

Eidetic 3D LSTM: A model for video prediction and beyond

Y Wang, L Jiang, MH Yang, LJ Li, M Long… - International …, 2018 - openreview.net

Spatiotemporal predictive learning, though long considered to be a promising self-
supervised feature learning method, seldom shows its effectiveness beyond future video …

被引用次数：397 相关文章所有 11 个版本

[PDF] thecvf.com

Greedy hierarchical variational autoencoders for large-scale video prediction

B Wu, S Nair, R Martin-Martin… - Proceedings of the …, 2021 - openaccess.thecvf.com

A video prediction model that generalizes to diverse scenes would enable intelligent agents
such as robots to perform a variety of tasks via planning with the model. However, while …

被引用次数：114 相关文章所有 6 个版本

高级搜索

QQ 群