Srformer: Permuted self-attention for single image super-resolution

Y Zhou, Z Li, CL Guo, S Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Previous works have shown that increasing the window size for Transformer-based image
super-resolution models (eg, SwinIR) can significantly improve the model performance but …

Extracting motion and appearance via inter-frame attention for efficient video frame interpolation

G Zhang, Y Zhu, H Wang, Y Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Effectively extracting inter-frame motion and appearance information is important for video
frame interpolation (VFI). Previous works either extract both types of information in a mixed …

Transformer meets remote sensing video detection and tracking: A comprehensive survey

L Jiao, X Zhang, X Liu, F Liu, S Yang… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
Transformer has shown excellent performance in remote sensing field with long-range
modeling capabilities. Remote sensing video (RSV) moving object detection and tracking …

Real-time intermediate flow estimation for video frame interpolation

Z Huang, T Zhang, W Heng, B Shi, S Zhou - European Conference on …, 2022 - Springer
Real-time video frame interpolation (VFI) is very useful in video processing, media players,
and display devices. We propose RIFE, a Real-time Intermediate Flow Estimation algorithm …

Amt: All-pairs multi-field transforms for efficient frame interpolation

Z Li, ZL Zhu, LH Han, Q Hou… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We present All-Pairs Multi-Field Transforms (AMT), a new network architecture for
video frame interpolation. It is based on two essential designs. First, we build bidirectional …

Cycmunet+: Cycle-projected mutual learning for spatial-temporal video super-resolution

M Hu, K Jiang, Z Wang, X Bai… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Spatial-Temporal Video Super-Resolution (ST-VSR) aims to generate high-quality videos
with higher resolution (HR) and higher frame rate (HFR). Quite intuitively, pioneering two …

TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving

S Fang, Z Wang, Y Zhong, J Ge… - Proceedings of the …, 2023 - openaccess.thecvf.com
Vision-centric joint perception and prediction (PnP) has become an emerging trend in
autonomous driving research. It predicts the future states of the traffic participants in the …

A unified pyramid recurrent network for video frame interpolation

X Jin, L Wu, J Chen, Y Chen, J Koo… - Proceedings of the …, 2023 - openaccess.thecvf.com
Flow-guided synthesis provides a common framework for frame interpolation, where optical
flow is estimated to guide the synthesis of intermediate frames between consecutive inputs …

Event-based video frame interpolation with cross-modal asymmetric bidirectional motion fields

T Kim, Y Chae, HK Jang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Video Frame Interpolation (VFI) aims to generate intermediate video frames
between consecutive input frames. Since the event cameras are bio-inspired sensors that …

Deep geometrized cartoon line inbetweening

L Siyao, T Gu, W Xiao, H Ding… - Proceedings of the …, 2023 - openaccess.thecvf.com
We aim to address a significant but understudied problem in the anime industry, namely the
inbetweening of cartoon line drawings. Inbetweening involves generating intermediate …