Pwc-net: Cnns for optical flow using pyramid, warping, and cost volume

S Karim, G Tong, J Li, A Qadir, U Farooq, Y Yu - Information Fusion, 2023 - Elsevier

Multiple imaging modalities can be combined to provide more information about the real
world than a single modality alone. Infrared images discriminate targets with respect to their …

被引用次数：125 相关文章所有 2 个版本

[PDF] arxiv.org

Deepfakes generation and detection: State-of-the-art, open challenges, countermeasures, and way forward

M Masood, M Nawaz, KM Malik, A Javed, A Irtaza… - Applied …, 2023 - Springer

Easy access to audio-visual content on social media, combined with the availability of
modern tools such as Tensorflow or Keras, and open-source trained models, along with …

被引用次数：386 相关文章所有 11 个版本

[PDF] arxiv.org

Cotracker: It is better to track together

N Karaev, I Rocco, B Graham, N Neverova… - … on Computer Vision, 2025 - Springer

We introduce CoTracker, a transformer-based model that tracks a large number of 2D points
in long video sequences. Differently from most existing approaches that track points …

被引用次数：175 相关文章所有 2 个版本

[PDF] thecvf.com

Tracking everything everywhere all at once

Q Wang, YY Chang, R Cai, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present a new test-time optimization method for estimating dense and long-range motion
from a video sequence. Prior optical flow or particle video tracking algorithms typically …

被引用次数：138 相关文章所有 5 个版本

[PDF] thecvf.com

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Y Zheng, AW Harley, B Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com

We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …

被引用次数：104 相关文章所有 5 个版本

[PDF] arxiv.org

Flowformer: A transformer architecture for optical flow

Z Huang, X Shi, C Zhang, Q Wang, KC Cheung… - European conference on …, 2022 - Springer

We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …

被引用次数：313 相关文章所有 5 个版本

[PDF] thecvf.com

Gmflow: Learning optical flow via global matching

H Xu, J Zhang, J Cai… - Proceedings of the …, 2022 - openaccess.thecvf.com

Learning-based optical flow estimation has been dominated with the pipeline of cost volume
with convolutions for flow regression, which is inherently limited to local correlations and …

被引用次数：394 相关文章所有 8 个版本

[PDF] arxiv.org

Unifying flow, stereo and depth estimation

H Xu, J Zhang, J Cai, H Rezatofighi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …

被引用次数：184 相关文章所有 15 个版本

[PDF] openreview.net

Perceiver io: A general architecture for structured inputs & outputs

A Jaegle, S Borgeaud, JB Alayrac, C Doersch… - arXiv preprint arXiv …, 2021 - arxiv.org

A central goal of machine learning is the development of systems that can solve many
problems in as many data domains as possible. Current architectures, however, cannot be …

被引用次数：607 相关文章所有 4 个版本

[PDF] thecvf.com

Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation

X Shi, Z Huang, D Li, M Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

FlowFormer introduces a transformer architecture into optical flow estimation and achieves
state-of-the-art performance. The core component of FlowFormer is the transformer-based …

被引用次数：95 相关文章所有 5 个版本

高级搜索

QQ 群