Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

Progress-aware online action segmentation for egocentric procedural task videos

Y Shen, E Elhamifar - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We address the problem of online action segmentation for egocentric procedural task
videos. While previous studies have mostly focused on offline action segmentation where …

Fast and unsupervised action boundary detection for action segmentation

Z Du, X Wang, G Zhou, Q Wang - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
To deal with the great number of untrimmed videos produced every day, we propose an
efficient unsupervised action segmentation method by detecting boundaries, named action …

Bottom-up skill discovery from unsegmented demonstrations for long-horizon robot manipulation

Y Zhu, P Stone, Y Zhu - IEEE Robotics and Automation Letters, 2022 - ieeexplore.ieee.org
We tackle real-world long-horizon robot manipulation tasks through skill discovery. We
present a bottom-up approach to learning a library of reusable skills from unsegmented …

Weakly-supervised online action segmentation in multi-view instructional videos

R Ghoddoosian, I Dwivedi, N Agarwal… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper addresses a new problem of weakly-supervised online action segmentation in
instructional videos. We present a framework to segment streaming videos online at test time …

3dfcnn: Real-time action recognition using 3d deep neural networks with raw depth information

A Sanchez-Caballero, S de López-Diz… - Multimedia Tools and …, 2022 - Springer
This work describes an end-to-end approach for real-time human action recognition from
raw depth image-sequences. The proposal is based on a 3D fully convolutional neural …

Leveraging triplet loss for unsupervised action segmentation

E Bueno-Benito, BT Vecino… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose a novel fully unsupervised framework that learns action
representations suitable for the action segmentation task from the single input video itself …

C2F-TCN: A framework for semi-and fully-supervised temporal action segmentation

D Singhania, R Rahaman, A Yao - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Temporal action segmentation tags action labels for every frame in an input untrimmed
video containing multiple actions in a sequence. For the task of temporal action …

Mhms: Multimodal hierarchical multimedia summarization

J Qiu, J Zhu, M Xu, F Dernoncourt, T Bui… - arXiv preprint arXiv …, 2022 - arxiv.org
Multimedia summarization with multimodal output can play an essential role in real-world
applications, ie, automatically generating cover images and titles for news articles or …

Sscap: Self-supervised co-occurrence action parsing for unsupervised temporal action segmentation

Z Wang, H Chen, X Li, C Liu, Y Xiong… - Proceedings of the …, 2022 - openaccess.thecvf.com
Temporal action segmentation is a task to classify each frame in the video with an action
label. However, it is quite expensive to annotate every frame in a large corpus of videos to …