Decomposing motion and content for natural video sequence prediction

W Byeon, Q Wang, RK Srivastava… - Proceedings of the …, 2018 - openaccess.thecvf.com

Video prediction models based on convolutional networks, recurrent networks, and their
combinations often result in blurry predictions. We identify an important contributing factor for …

被引用次数：176 相关文章所有 10 个版本

[PDF] thecvf.com

Imaginator: Conditional spatio-temporal gan for video generation

Y Wang, P Bilinski, F Bremond… - Proceedings of the …, 2020 - openaccess.thecvf.com

Generating human videos based on single images entails the challenging simultaneous
generation of realistic and visual appealing appearance and motion. In this context, we …

被引用次数：114 相关文章所有 9 个版本

[PDF] thecvf.com

S3vae: Self-supervised sequential vae for representation disentanglement and data generation

Y Zhu, MR Min, A Kadav… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

We propose a sequential variational autoencoder to learn disentangled representations of
sequential data (eg, videos and audios) under self-supervision. Specifically, we exploit the …

被引用次数：106 相关文章所有 7 个版本

[PDF] thecvf.com

Dnerv: Modeling inherent dynamics via difference neural representation for videos

Q Zhao, MS Asif, Z Ma - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

Existing implicit neural representation (INR) methods do not fully exploit spatiotemporal
redundancies in videos. Index-based INRs ignore the content-specific spatial features and …

被引用次数：14 相关文章所有 5 个版本

[PDF] arxiv.org

Convtransformer: A convolutional transformer network for video frame synthesis

Z Liu, S Luo, W Li, J Lu, Y Wu, S Sun, C Li… - arXiv preprint arXiv …, 2020 - arxiv.org

Deep Convolutional Neural Networks (CNNs) are powerful models that have achieved
excellent performance on difficult computer vision tasks. Although CNNs perform well …

被引用次数：83 相关文章所有 2 个版本

[PDF] thecvf.com

Flow-grounded spatial-temporal video prediction from still images

Y Li, C Fang, J Yang, Z Wang, X Lu… - Proceedings of the …, 2018 - openaccess.thecvf.com

Existing video prediction methods mainly rely on observing multiple historical frames or
focus on predicting the next one-frame. In this work, we study the problem of generating …

被引用次数：149 相关文章所有 11 个版本

[PDF] researchgate.net

[PDF][PDF] Videoflow: A flow-based generative model for video

M Kumar, M Babaeizadeh, D Erhan… - arXiv preprint arXiv …, 2019 - researchgate.net

Generative models that can model and predict sequences of future events can, in principle,
learn to capture complex real-world phenomena, such as physical interactions. In particular …

被引用次数：142 相关文章所有 2 个版本

[PDF] ijcai.org

[PDF][PDF] Margin Learning Embedded Prediction for Video Anomaly Detection with A Few Anomalies.

W Liu, W Luo, Z Li, P Zhao, S Gao - IJCAI, 2019 - ijcai.org

Classical semi-supervised video anomaly detection assumes that only normal data are
available in the training set because of the rare and unbounded nature of anomalies. It is …

被引用次数：114 相关文章所有 5 个版本

[PDF] neurips.cc

Pre-training contextualized world models with in-the-wild videos for reinforcement learning

J Wu, H Ma, C Deng, M Long - Advances in Neural …, 2024 - proceedings.neurips.cc

Unsupervised pre-training methods utilizing large and diverse datasets have achieved
tremendous success across a range of domains. Recent work has investigated such …

被引用次数：10 相关文章所有 7 个版本

[PDF] mlr.press

Hierarchical long-term video prediction without supervision

R Villegas, D Erhan, H Lee - International Conference on …, 2018 - proceedings.mlr.press

Much of recent research has been devoted to video prediction and generation, yet most of
the previous works have demonstrated only limited success in generating videos on short …

被引用次数：153 相关文章所有 7 个版本

高级搜索

QQ 群