A review of convolutional neural network architectures and their optimizations

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer
The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

Deep learning technique for human parsing: A survey and outlook

L Yang, W Jia, S Li, Q Song - International Journal of Computer Vision, 2024 - Springer
Human parsing aims to partition humans in image or video into multiple pixel-level semantic
parts. In the last decade, it has gained significantly increased interest in the computer vision …

Boosting video object segmentation via space-time correspondence learning

Y Zhang, L Li, W Wang, R Xie… - Proceedings of the …, 2023 - openaccess.thecvf.com
Current top-leading solutions for video object segmentation (VOS) typically follow a
matching-based regime: for each query frame, the segmentation mask is inferred according …

Joint inductive and transductive learning for video object segmentation

Y Mao, N Wang, W Zhou, H Li - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Semi-supervised video object segmentation is a task of segmenting the target object in a
video sequence given only a mask annotation in the first frame. The limited information …

Cmd: Self-supervised 3d action representation learning with cross-modal mutual distillation

Y Mao, W Zhou, Z Lu, J Deng, H Li - European Conference on Computer …, 2022 - Springer
In 3D action recognition, there exists rich complementary information between skeleton
modalities. Nevertheless, how to model and utilize this information remains a challenging …

Reformulating graph kernels for self-supervised space-time correspondence learning

Z Qin, X Lu, D Liu, X Nie, Y Yin, J Shen… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Self-supervised space-time correspondence learning utilizing unlabeled videos holds great
potential in computer vision. Most existing methods rely on contrastive learning with mining …

Locality-aware inter-and intra-video reconstruction for self-supervised correspondence learning

L Li, T Zhou, W Wang, L Yang, J Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
Our target is to learn visual correspondence from unlabeled videos. We develop LIIR, a
locality-aware inter-and intra-video reconstruction framework that fills in three missing …

Unified mask embedding and correspondence learning for self-supervised video segmentation

L Li, W Wang, T Zhou, J Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The objective of this paper is self-supervised learning of video object segmentation. We
develop a unified framework which simultaneously models cross-frame dense …

Exposing the self-supervised space-time correspondence learning via graph kernels

Z Qin, X Lu, X Nie, Y Yin, J Shen - … of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Self-supervised space-time correspondence learning is emerging as a promising way of
leveraging unlabeled video. Currently, most methods adapt contrastive learning with mining …

Dense unsupervised learning for video segmentation

N Araslanov, S Schaub-Meyer… - Advances in neural …, 2021 - proceedings.neurips.cc
We present a novel approach to unsupervised learning for video object segmentation (VOS).
Unlike previous work, our formulation allows to learn dense feature representations directly …