End-to-end temporal action detection with transformer

X Liu, Q Wang, Y Hu, X Tang, S Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Temporal action detection (TAD) aims to determine the semantic label and the temporal
interval of every action instance in an untrimmed video. It is a fundamental and challenging …

Proposal-based multiple instance learning for weakly-supervised temporal action localization

H Ren, W Yang, T Zhang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Weakly-supervised temporal action localization aims to localize and recognize actions in
untrimmed videos with only video-level category labels during training. Without instance …

Foreground activation maps for weakly supervised object localization

M Meng, T Zhang, Q Tian… - Proceedings of the …, 2021 - openaccess.thecvf.com
Weakly supervised object localization (WSOL) aims to localize objects with only image-level
labels, which has better scalability and practicability than fully supervised methods in the …

Task-aware part mining network for few-shot learning

J Wu, T Zhang, Y Zhang, F Wu - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract Few-Shot Learning (FSL) aims at classifying samples into new unseen classes with
only a handful of labeled samples available. However, most of the existing methods are …

Spatial-temporal based multihead self-attention for remote sensing image change detection

Y Zhou, F Wang, J Zhao, R Yao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The neural network-based remote sensing image change detection method faces a large
amount of imaging interference and severe class imbalance problems under high-resolution …

Learning Models in Crowd Analysis: A Review

S Goel, D Koundal, R Nijhawan - Archives of Computational Methods in …, 2024 - Springer
Crowd detection and counting are important tasks in several applications of crowd analysis
including traffic management, public safety and event planning. Automatic crowd counting …

Temporal action localization in the deep learning era: A survey

B Wang, Y Zhao, L Yang, T Long… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The temporal action localization research aims to discover action instances from untrimmed
videos, representing a fundamental step in the field of intelligent video understanding. With …

Compact representation and reliable classification learning for point-level weakly-supervised action localization

J Fu, J Gao, C Xu - IEEE Transactions on Image Processing, 2022 - ieeexplore.ieee.org
Point-level weakly-supervised temporal action localization (P-WSTAL) aims to localize
temporal extents of action instances and identify the corresponding categories with only a …

A novel deep learning framework for automatic recognition of thyroid gland and tissues of neck in ultrasound image

L Ma, G Tan, H Luo, Q Liao, S Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Recognition of thyroid glands and tissues of the neck is vital for screening related diseases
in ultrasound videos. This task is subjective, challenging, and dependent on the experience …

Imposing semantic consistency of local descriptors for few-shot learning

J Cheng, F Hao, L Liu, D Tao - IEEE Transactions on Image …, 2022 - ieeexplore.ieee.org
Few-shot learning suffers from the scarcity of labeled training data. Regarding local
descriptors of an image as representations for the image could greatly augment existing …