Decomposing motion and content for natural video sequence prediction

H Wu, Z Yao, J Wang, M Long - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

This paper tackles video prediction from a new dimension of predicting spacetime-varying
motions that are incessantly changing across both space and time. Prior methods mainly …

被引用次数：139 相关文章所有 6 个版本

[PDF] arxiv.org

Infinitenature-zero: Learning perpetual view generation of natural scenes from single images

Z Li, Q Wang, N Snavely, A Kanazawa - European Conference on …, 2022 - Springer

We present a method for learning to generate unbounded flythrough videos of natural
scenes starting from a single view. This capability is learned from a collection of single …

被引用次数：43 相关文章所有 8 个版本

[PDF] neurips.cc

Mau: A motion-aware unit for video prediction and beyond

Z Chang, X Zhang, S Wang, S Ma… - Advances in …, 2021 - proceedings.neurips.cc

Accurately predicting inter-frame motion information plays a key role in video prediction
tasks. In this paper, we propose a Motion-Aware Unit (MAU) to capture reliable inter-frame …

被引用次数：77 相关文章所有 5 个版本

[PDF] neurips.cc

Learning to decompose and disentangle representations for video prediction

JT Hsieh, B Liu, DA Huang… - Advances in neural …, 2018 - proceedings.neurips.cc

Our goal is to predict future video frames given a sequence of input frames. Despite large
amounts of video data, this remains a challenging task because of the high-dimensionality of …

被引用次数：342 相关文章所有 6 个版本

[PDF] arxiv.org

Video coding for machines: A paradigm of collaborative compression and intelligent analytics

L Duan, J Liu, W Yang, T Huang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Video coding, which targets to compress and reconstruct the whole frame, and feature
compression, which only preserves and transmits the most critical information, stand at two …

被引用次数：212 相关文章所有 10 个版本

[PDF] mlr.press

Learning to generate long-term future via hierarchical prediction

R Villegas, J Yang, Y Zou, S Sohn… - … on machine learning, 2017 - proceedings.mlr.press

We propose a hierarchical approach for making long-term predictions of future frames. To
avoid inherent compounding errors in recursive pixel-level prediction, we propose to first …

被引用次数：436 相关文章所有 8 个版本

[PDF] thecvf.com

Video prediction recalling long-term motion context via memory alignment learning

S Lee, HG Kim, DH Choi, HI Kim… - Proceedings of the …, 2021 - openaccess.thecvf.com

Our work addresses long-term motion context issues for predicting future frames. To predict
the future precisely, it is required to capture which long-term motion context (eg, walking or …

被引用次数：110 相关文章所有 10 个版本

[PDF] arxiv.org

Adversarial video generation on complex datasets

A Clark, J Donahue, K Simonyan - arXiv preprint arXiv:1907.06571, 2019 - arxiv.org

Generative models of natural images have progressed towards high fidelity samples by the
strong leveraging of scale. We attempt to carry this success to the field of video modeling by …

被引用次数：209 相关文章所有 4 个版本

基于计算机视觉的Transformer 研究进展.

刘文婷，卢新明 - Journal of Computer Engineering & …, 2022 - search.ebscohost.com

Transformer 是一种基于自注意力机制, 并行化处理数据的深度神经网络. 近几年基于
Transformer 的模型成为计算机视觉任务的重要研究方向. 针对目前国内基于Transformer …

被引用次数：22 相关文章所有 3 个版本

Video anomaly detection with sparse coding inspired deep neural networks

W Luo, W Liu, D Lian, J Tang, L Duan… - IEEE transactions on …, 2019 - ieeexplore.ieee.org

This paper presents an anomaly detection method that is based on a sparse coding inspired
Deep Neural Networks (DNN). Specifically, in light of the success of sparse coding based …

被引用次数：199 相关文章所有 5 个版本

高级搜索

QQ 群