Multimodal learning with transformers: A survey

P Xu, X Zhu, DA Clifton - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …

Learning multiple adverse weather removal via two-stage knowledge learning and multi-contrastive regularization: Toward a unified model

WT Chen, ZK Huang, CC Tsai… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, an ill-posed problem of multiple adverse weather removal is investigated. Our
goal is to train a model with a'unified'architecture and only one set of pretrained weights that …

A survey on semi-, self-and unsupervised learning for image classification

L Schmarje, M Santarossa, SM Schröder… - IEEE Access, 2021 - ieeexplore.ieee.org
While deep learning strategies achieve outstanding results in computer vision tasks, one
issue remains: The current strategies rely heavily on a huge amount of labeled data. In many …

Deep learning for free-hand sketch: A survey

P Xu, TM Hospedales, Q Yin, YZ Song… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Free-hand sketches are highly illustrative, and have been widely used by humans to depict
objects or stories from ancient times to the present. The recent prevalence of touchscreen …

Multigraph transformer for free-hand sketch recognition

P Xu, CK Joshi, X Bresson - IEEE Transactions on Neural …, 2021 - ieeexplore.ieee.org
Learning meaningful representations of free-hand sketches remains a challenging task
given the signal sparsity and the high-level abstraction of sketches. Existing techniques …

An improved inter-intra contrastive learning framework on self-supervised video representation

L Tao, X Wang, T Yamasaki - … on Circuits and Systems for Video …, 2022 - ieeexplore.ieee.org
In this paper, we propose a self-supervised contrastive learning method to learn video
feature representations. In traditional self-supervised contrastive learning methods …

Embedding global contrastive and local location in self-supervised learning

W Zhao, C Li, W Zhang, L Yang… - … on Circuits and …, 2022 - ieeexplore.ieee.org
Self-supervised representation learning (SSL) typically suffers from inadequate data
utilization and feature-specificity due to the suboptimal sampling strategy and the …

Multistage spatio-temporal networks for robust sketch recognition

H Li, X Jiang, B Guan, R Wang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Sketch recognition relies on two types of information, namely, spatial contexts like the local
structures in images and temporal contexts like the orders of strokes. Existing methods …

Self-supervised representation learning for videos by segmenting via sampling rate order prediction

J Huang, Y Huang, Q Wang, W Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Self-supervised representation learning for videos has been very attractive recently because
these methods exploit the information inherently obtained from the video itself instead of …

Fine-grained instance-level sketch-based video retrieval

P Xu, K Liu, T Xiang, TM Hospedales… - … on Circuits and …, 2020 - ieeexplore.ieee.org
Existing sketch-analysis work studies sketches depicting static objects or scenes. In this
work, we propose a novel cross-modal retrieval problem of fine-grained instance-level …