MoQuad: motion-focused quadruple construction for video contrastive learning

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

MoQuad: motion-focused quadruple construction for video contrastive learning

在引用文章中搜索

[PDF] thecvf.com

Improving pixel-based mim by reducing wasted modeling capability

Y Liu, S Zhang, J Chen, Z Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com

There has been significant progress in Masked Image Modeling (MIM). Existing MIM
methods can be broadly categorized into two groups based on the reconstruction target …

被引用次数：19 相关文章所有 6 个版本

[PDF] arxiv.org

Pixmim: Rethinking pixel reconstruction in masked image modeling

Y Liu, S Zhang, J Chen, K Chen, D Lin - arXiv preprint arXiv:2303.02416, 2023 - arxiv.org

Masked Image Modeling (MIM) has achieved promising progress with the advent of Masked
Autoencoders (MAE) and BEiT. However, subsequent works have complicated the …

被引用次数：20 相关文章所有 3 个版本

[PDF] springer.com

Controllable augmentations for video representation learning

R Qian, W Lin, J See, D Li - Visual Intelligence, 2024 - Springer

This paper focuses on self-supervised video representation learning. Most existing
approaches follow the contrastive learning pipeline to construct positive and negative pairs …

被引用次数：3 相关文章所有 7 个版本

高级搜索

QQ 群

MoQuad: motion-focused quadruple construction for video contrastive learning

Improving pixel-based mim by reducing wasted modeling capability

Pixmim: Rethinking pixel reconstruction in masked image modeling

Controllable augmentations for video representation learning

引用