Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

Deep learning-based action detection in untrimmed videos: A survey

E Vahdani, Y Tian - IEEE Transactions on Pattern Analysis and …, 2022 - ieeexplore.ieee.org
Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …

Asformer: Transformer for action segmentation

F Yi, H Wen, T Jiang - arXiv preprint arXiv:2110.08568, 2021 - arxiv.org
Algorithms for the action segmentation task typically use temporal models to predict what
action is occurring at each frame for a minute-long daily activity. Recent studies have shown …

Ms-tcn: Multi-stage temporal convolutional network for action segmentation

YA Farha, J Gall - Proceedings of the IEEE/CVF conference …, 2019 - openaccess.thecvf.com
Temporally locating and classifying action segments in long untrimmed videos is of
particular interest to many applications like surveillance and robotics. While traditional …

Ms-tcn++: Multi-stage temporal convolutional network for action segmentation

S Li, YA Farha, Y Liu, MM Cheng… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
With the success of deep learning in classifying short trimmed videos, more attention has
been focused on temporally segmenting and classifying activities in long untrimmed videos …

Towards automatic learning of procedures from web instructional videos

L Zhou, C Xu, J Corso - Proceedings of the AAAI Conference on …, 2018 - ojs.aaai.org
The potential for agents, whether embodied or software, to learn by observing other agents
performing procedures involving objects and actions is rich. Current research on automatic …

W-talc: Weakly-supervised temporal activity localization and classification

S Paul, S Roy… - Proceedings of the …, 2018 - openaccess.thecvf.com
Most activity localization methods in the literature suffer from the burden of frame-wise
annotation requirement. Learning from weak labels may be a potential solution towards …

Untrimmednets for weakly supervised action recognition and detection

L Wang, Y Xiong, D Lin… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Current action recognition methods heavily rely on trimmed videos for model training.
However, it is expensive and time-consuming to acquire a large-scale trimmed video …

Alleviating over-segmentation errors by detecting action boundaries

Y Ishikawa, S Kasai, Y Aoki… - Proceedings of the …, 2021 - openaccess.thecvf.com
We propose an effective framework for the temporal action segmentation task, namely an
Action Segment Refinement Framework (ASRF). Our model architecture consists of a long …

Cross-task weakly supervised learning from instructional videos

D Zhukov, JB Alayrac, RG Cinbis… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this paper we investigate learning visual models for the steps of ordinary tasks using weak
supervision via instructional narrations and an ordered list of steps instead of strong …