Video frame interpolation with transformer

Y Zhou, Z Li, CL Guo, S Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com

Previous works have shown that increasing the window size for Transformer-based image
super-resolution models (eg, SwinIR) can significantly improve the model performance but …

被引用次数：153 相关文章所有 5 个版本

[PDF] thecvf.com

Extracting motion and appearance via inter-frame attention for efficient video frame interpolation

G Zhang, Y Zhu, H Wang, Y Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Effectively extracting inter-frame motion and appearance information is important for video
frame interpolation (VFI). Previous works either extract both types of information in a mixed …

被引用次数：109 相关文章所有 6 个版本

[PDF] ieee.org

Transformer meets remote sensing video detection and tracking: A comprehensive survey

L Jiao, X Zhang, X Liu, F Liu, S Yang… - IEEE Journal of …, 2023 - ieeexplore.ieee.org

Transformer has shown excellent performance in remote sensing field with long-range
modeling capabilities. Remote sensing video (RSV) moving object detection and tracking …

被引用次数：25 相关文章所有 2 个版本

[PDF] arxiv.org

Real-time intermediate flow estimation for video frame interpolation

Z Huang, T Zhang, W Heng, B Shi, S Zhou - European Conference on …, 2022 - Springer

Real-time video frame interpolation (VFI) is very useful in video processing, media players,
and display devices. We propose RIFE, a Real-time Intermediate Flow Estimation algorithm …

被引用次数：390 相关文章所有 7 个版本

[PDF] thecvf.com

Amt: All-pairs multi-field transforms for efficient frame interpolation

Z Li, ZL Zhu, LH Han, Q Hou… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract We present All-Pairs Multi-Field Transforms (AMT), a new network architecture for
video frame interpolation. It is based on two essential designs. First, we build bidirectional …

被引用次数：81 相关文章所有 6 个版本

Cycmunet+: Cycle-projected mutual learning for spatial-temporal video super-resolution

M Hu, K Jiang, Z Wang, X Bai… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Spatial-Temporal Video Super-Resolution (ST-VSR) aims to generate high-quality videos
with higher resolution (HR) and higher frame rate (HFR). Quite intuitively, pioneering two …

被引用次数：29 相关文章所有 6 个版本

[PDF] thecvf.com

TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving

S Fang, Z Wang, Y Zhong, J Ge… - Proceedings of the …, 2023 - openaccess.thecvf.com

Vision-centric joint perception and prediction (PnP) has become an emerging trend in
autonomous driving research. It predicts the future states of the traffic participants in the …

被引用次数：26 相关文章所有 6 个版本

[PDF] thecvf.com

A unified pyramid recurrent network for video frame interpolation

X Jin, L Wu, J Chen, Y Chen, J Koo… - Proceedings of the …, 2023 - openaccess.thecvf.com

Flow-guided synthesis provides a common framework for frame interpolation, where optical
flow is estimated to guide the synthesis of intermediate frames between consecutive inputs …

被引用次数：50 相关文章所有 5 个版本

[PDF] thecvf.com

Event-based video frame interpolation with cross-modal asymmetric bidirectional motion fields

T Kim, Y Chae, HK Jang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Abstract Video Frame Interpolation (VFI) aims to generate intermediate video frames
between consecutive input frames. Since the event cameras are bio-inspired sensors that …

被引用次数：23 相关文章所有 4 个版本

[PDF] thecvf.com

Deep geometrized cartoon line inbetweening

L Siyao, T Gu, W Xiao, H Ding… - Proceedings of the …, 2023 - openaccess.thecvf.com

We aim to address a significant but understudied problem in the anime industry, namely the
inbetweening of cartoon line drawings. Inbetweening involves generating intermediate …

被引用次数：13 相关文章所有 5 个版本

高级搜索

QQ 群