A dynamic multi-scale voxel flow network for video prediction

G Zhang, Y Zhu, H Wang, Y Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Effectively extracting inter-frame motion and appearance information is important for video
frame interpolation (VFI). Previous works either extract both types of information in a mixed …

被引用次数：64 相关文章所有 6 个版本

[PDF] thecvf.com

Generative image dynamics

Z Li, R Tucker, N Snavely… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

We present an approach to modeling an image-space prior on scene motion. Our prior is
learned from a collection of motion trajectories extracted from real video sequences …

被引用次数：32 相关文章所有 9 个版本

[PDF] thecvf.com

Videoflow: Exploiting temporal cues for multi-frame optical flow estimation

X Shi, Z Huang, W Bian, D Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

We introduce VideoFlow, a novel optical flow estimation framework for videos. In contrast to
previous methods that learn to estimate optical flow from two frames, VideoFlow concurrently …

被引用次数：34 相关文章所有 5 个版本

[PDF] thecvf.com

Amt: All-pairs multi-field transforms for efficient frame interpolation

Z Li, ZL Zhu, LH Han, Q Hou… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract We present All-Pairs Multi-Field Transforms (AMT), a new network architecture for
video frame interpolation. It is based on two essential designs. First, we build bidirectional …

被引用次数：32 相关文章所有 6 个版本

[PDF] neurips.cc

Openstl: A comprehensive benchmark of spatio-temporal predictive learning

C Tan, S Li, Z Gao, W Guan, Z Wang… - Advances in …, 2023 - proceedings.neurips.cc

Spatio-temporal predictive learning is a learning paradigm that enables models to learn
spatial and temporal patterns by predicting future frames from given past frames in an …

被引用次数：25 相关文章所有 7 个版本

[PDF] arxiv.org

Dynamicrafter: Animating open-domain images with video diffusion priors

J Xing, M Xia, Y Zhang, H Chen, X Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

Animating a still image offers an engaging visual experience. Traditional image animation
techniques mainly focus on animating natural scenes with stochastic dynamics (eg clouds …

被引用次数：45 相关文章所有 2 个版本

[PDF] thecvf.com

Mmvp: Motion-matrix-based video prediction

Y Zhong, L Liang, I Zharkov… - Proceedings of the …, 2023 - openaccess.thecvf.com

A central challenge of video prediction lies where the system has to reason the object's
future motion from image frames while simultaneously maintaining the consistency of its …

被引用次数：11 相关文章所有 7 个版本

[PDF] arxiv.org

Lamp: Learn a motion pattern for few-shot-based video generation

R Wu, L Chen, T Yang, C Guo, C Li, X Zhang - arXiv preprint arXiv …, 2023 - arxiv.org

With the impressive progress in diffusion-based text-to-image generation, extending such
powerful generative ability to text-to-video raises enormous attention. Existing methods …

被引用次数：28 相关文章所有 2 个版本

[PDF] thecvf.com

Vmrnn: Integrating vision mamba and lstm for efficient and accurate spatiotemporal forecasting

Y Tang, P Dong, Z Tang, X Chu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Combining Convolutional Neural Networks (CNNs) or Vision Transformers (ViTs)
with Recurrent Neural Networks (RNNs) for spatiotemporal forecasting has yielded …

被引用次数：6 相关文章所有 4 个版本

[PDF] thecvf.com

LAMP: Learn A Motion Pattern for Few-Shot Video Generation

R Wu, L Chen, T Yang, C Guo, C Li… - Proceedings of the …, 2024 - openaccess.thecvf.com

In this paper we present a few-shot text-to-video framework LAMP which enables a text-to-
image diffusion model to Learn A specific Motion Pattern with 8 16 videos on a single GPU …

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群