Frame-wise action representations for long videos via sequence contrastive learning

H Yan, Y Liu, Y Wei, Z Li, G Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Skeleton sequence representation learning has shown great advantages for action
recognition due to its promising ability to model human joints and topology. However, the …

被引用次数：21 相关文章所有 5 个版本

[PDF] neurips.cc

Learning fine-grained view-invariant representations from unpaired ego-exo videos via temporal alignment

ZS Xue, K Grauman - Advances in Neural Information …, 2023 - proceedings.neurips.cc

The egocentric and exocentric viewpoints of a human activity look dramatically different, yet
invariant representations to link them are essential for many potential applications in …

被引用次数：12 相关文章所有 6 个版本

[PDF] thecvf.com

Modeling video as stochastic processes for fine-grained video representation learning

H Zhang, D Liu, Q Zheng, B Su - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

A meaningful video is semantically coherent and changes smoothly. However, most existing
fine-grained video representation learning methods learn frame-wise features by aligning …

被引用次数：10 相关文章所有 4 个版本

[PDF] thecvf.com

Weakly supervised video representation learning with unaligned text for sequential videos

S Dong, H Hu, D Lian, W Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com

Sequential video understanding, as an emerging video understanding task, has driven lots
of researchers' attention because of its goal-oriented nature. This paper studies weakly …

被引用次数：7 相关文章所有 5 个版本

[PDF] neurips.cc

Learning viewpoint-agnostic visual representations by recovering tokens in 3d space

J Shang, S Das, M Ryoo - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Humans are remarkably flexible in understanding viewpoint changes due to visual cortex
supporting the perception of 3D structure. In contrast, most of the computer vision models …

被引用次数：11 相关文章所有 7 个版本

[PDF] arxiv.org

Universal time-series representation learning: A survey

P Trirat, Y Shin, J Kang, Y Nam, J Na, M Bae… - arXiv preprint arXiv …, 2024 - arxiv.org

Time-series data exists in every corner of real-world systems and services, ranging from
satellites in the sky to wearable devices on human bodies. Learning representations by …

被引用次数：3 相关文章所有 2 个版本

[PDF] mlr.press

Context consistency regularization for label sparsity in time series

Y Shin, S Yoon, H Song, D Park, B Kim… - International …, 2023 - proceedings.mlr.press

Labels are typically sparse in real-world time series due to the high annotation cost.
Recently, consistency regularization techniques have been used to generate artificial labels …

被引用次数：3 相关文章所有 5 个版本

Solution of wide and micro background bias in contrastive action representation learning

S Liu, Z Luo, Y Li, Y Wang, W Fu, W Ding - Engineering Applications of …, 2024 - Elsevier

In recent years, contrastive learning has made great progress in the field of computer vision,
which shows great potential in action representation learning. Current contrastive learning …

被引用次数：2 相关文章

[PDF] openreview.net

Conditional Information Bottleneck Approach for Time Series Imputation

MG Choi, C Lee - The Twelfth International Conference on Learning …, 2023 - openreview.net

Time series imputation presents a significant challenge because it requires capturing the
underlying temporal dynamics from partially observed time series data. Among the recent …

被引用次数：2 相关文章

[PDF] mdpi.com

Action Detection for Wildlife Monitoring with Camera Traps Based on Segmentation with Filtering of Tracklets (SWIFT) and Mask-Guided Action Recognition …

F Schindler, V Steinhage, STS van Beeck Calkoen… - Applied Sciences, 2024 - mdpi.com

Behavioral analysis of animals in the wild plays an important role for ecological research
and conservation and has been mostly performed by researchers. We introduce an action …

被引用次数：1 相关文章所有 5 个版本

高级搜索

QQ 群