Connectionist temporal modeling for weakly supervised action labeling

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

被引用次数：34 相关文章所有 8 个版本

[PDF] arxiv.org

Deep learning-based action detection in untrimmed videos: A survey

E Vahdani, Y Tian - IEEE Transactions on Pattern Analysis and …, 2022 - ieeexplore.ieee.org

Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …

被引用次数：55 相关文章所有 8 个版本

[PDF] thecvf.com

Howto100m: Learning a text-video embedding by watching hundred million narrated video clips

A Miech, D Zhukov, JB Alayrac… - Proceedings of the …, 2019 - openaccess.thecvf.com

Learning text-video embeddings usually requires a dataset of video clips with manually
provided captions. However, such datasets are expensive and time consuming to create and …

被引用次数：1116 相关文章所有 10 个版本

[PDF] springer.com

Rescaling egocentric vision: Collection, pipeline and challenges for epic-kitchens-100

D Damen, H Doughty, GM Farinella, A Furnari… - International Journal of …, 2022 - Springer

This paper introduces the pipeline to extend the largest dataset in egocentric vision, EPIC-
KITCHENS. The effort culminates in EPIC-KITCHENS-100, a collection of 100 hours, 20M …

被引用次数：424 相关文章所有 13 个版本

[PDF] thecvf.com

Ms-tcn: Multi-stage temporal convolutional network for action segmentation

YA Farha, J Gall - Proceedings of the IEEE/CVF conference …, 2019 - openaccess.thecvf.com

Temporally locating and classifying action segments in long untrimmed videos is of
particular interest to many applications like surveillance and robotics. While traditional …

被引用次数：641 相关文章所有 10 个版本

[PDF] arxiv.org

Unified fully and timestamp supervised temporal action segmentation via sequence to sequence translation

N Behrmann, SA Golestaneh, Z Kolter, J Gall… - European conference on …, 2022 - Springer

This paper introduces a unified framework for video action segmentation via sequence to
sequence (seq2seq) translation in a fully and timestamp supervised setup. In contrast to …

被引用次数：67 相关文章所有 4 个版本

[PDF] arxiv.org

Human action recognition and prediction: A survey

Y Kong, Y Fu - International Journal of Computer Vision, 2022 - Springer

Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …

被引用次数：659 相关文章所有 6 个版本

[PDF] thecvf.com

Neural sign language translation

NC Camgoz, S Hadfield, O Koller… - Proceedings of the …, 2018 - openaccess.thecvf.com

Abstract Sign Language Recognition (SLR) has been an active research field for the last two
decades. However, most research to date has considered SLR as a naive gesture …

被引用次数：617 相关文章所有 16 个版本

[PDF] thecvf.com

Temporal convolutional networks for action segmentation and detection

C Lea, MD Flynn, R Vidal, A Reiter… - proceedings of the …, 2017 - openaccess.thecvf.com

The ability to identify and temporally segment fine-grained human actions throughout a
video is crucial for robotics, surveillance, education, and beyond. Typical approaches …

被引用次数：1709 相关文章所有 13 个版本

[PDF] thecvf.com

Hide-and-seek: Forcing a network to be meticulous for weakly-supervised object and action localization

K Kumar Singh, Y Jae Lee - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

Abstract We propose'Hide-and-Seek', a weakly-supervised framework that aims to improve
object localization in images and action localization in videos. Most existing weakly …

被引用次数：797 相关文章所有 16 个版本

高级搜索

QQ 群