A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

Multi-modal 3d object detection in autonomous driving: A survey and taxonomy

L Wang, X Zhang, Z Song, J Bi, G Zhang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Autonomous vehicles require constant environmental perception to obtain the distribution of
obstacles to achieve safe driving. Specifically, 3D object detection is a vital functional …

Spot-the-difference self-supervised pre-training for anomaly detection and segmentation

Y Zou, J Jeong, L Pemula, D Zhang… - European Conference on …, 2022 - Springer
Visual anomaly detection is commonly used in industrial quality inspection. In this paper, we
present a new dataset as well as a new self-supervised learning method for ImageNet pre …

Federated learning from pre-trained models: A contrastive learning approach

Y Tan, G Long, J Ma, L Liu, T Zhou… - Advances in neural …, 2022 - proceedings.neurips.cc
Federated Learning (FL) is a machine learning paradigm that allows decentralized clients to
learn collaboratively without sharing their private data. However, excessive computation and …

Sparse r-cnn: End-to-end object detection with learnable proposals

P Sun, R Zhang, Y Jiang, T Kong… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We present Sparse R-CNN, a purely sparse method for object detection in images.
Existing works on object detection heavily rely on dense object candidates, such as k anchor …

Multi-granularity cross-modal alignment for generalized medical visual representation learning

F Wang, Y Zhou, S Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Learning medical visual representations directly from paired radiology reports has become
an emerging topic in representation learning. However, existing medical image-text joint …

HoLoCo: Holistic and local contrastive learning network for multi-exposure image fusion

J Liu, G Wu, J Luan, Z Jiang, R Liu, X Fan - Information Fusion, 2023 - Elsevier
Multi-exposure image fusion (MEF) targets to integrate multiple shots with different
exposures and generates a single higher dynamic image than each. Existing deep learning …

C2am: Contrastive learning of class-agnostic activation map for weakly supervised object localization and semantic segmentation

J Xie, J Xiang, J Chen, X Hou… - Proceedings of the …, 2022 - openaccess.thecvf.com
While class activation map (CAM) generated by image classification network has been
widely used for weakly supervised object localization (WSOL) and semantic segmentation …

[HTML][HTML] 基于深度学习的视觉目标检测技术综述

曹家乐, 李亚利, 孙汉卿, 谢今, 黄凯奇, 庞彦伟 - 2022 - cjig.cn
摘要视觉目标检测旨在定位和识别图像中存在的物体, 属于计算机视觉领域的经典任务之一,
也是许多计算机视觉任务的前提与基础, 在自动驾驶, 视频监控等领域具有重要的应用价值 …

Efficient self-supervised vision transformers for representation learning

C Li, J Yang, P Zhang, M Gao, B Xiao, X Dai… - arXiv preprint arXiv …, 2021 - arxiv.org
This paper investigates two techniques for developing efficient self-supervised vision
transformers (EsViT) for visual representation learning. First, we show through a …