Learning vision from models rivals learning vision from data

Y Tian, L Fan, K Chen, D Katabi… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce SynCLR a novel approach for learning visual representations exclusively from
synthetic images without any real data. We synthesize a large dataset of image captions …

Expanding small-scale datasets with guided imagination

Y Zhang, D Zhou, B Hooi, K Wang… - Advances in Neural …, 2024 - proceedings.neurips.cc
The power of DNNs relies heavily on the quantity and quality of training data. However,
collecting and annotating data on a large scale is often expensive and time-consuming. To …

A survey of efficient fine-tuning methods for Vision-Language Models—Prompt and Adapter

J Xing, J Liu, J Wang, L Sun, X Chen, X Gu… - Computers & Graphics, 2024 - Elsevier
Abstract Vision Language Model (VLM) is a popular research field located at the fusion of
computer vision and natural language processing (NLP). With the emergence of transformer …

Temporal knowledge sharing enable spiking neural network learning from past and future

Y Dong, D Zhao, Y Zeng - IEEE Transactions on Artificial …, 2024 - ieeexplore.ieee.org
Spiking Neural Networks (SNNs) have attracted significant attention from researchers across
various domains due to their brain-inspired information processing mechanism. However …

UCG: A Universal Cross-Domain Generator for Transferable Adversarial Examples

Z Li, W Wang, J Li, K Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Generating transferable adversarial examples is a challenging issue in adversarial attacks.
Existing works on transferable adversarial examples generation mainly focus on models …

FACL-Attack: Frequency-Aware Contrastive Learning for Transferable Adversarial Attacks

H Yang, J Jeong, KJ Yoon - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Deep neural networks are known to be vulnerable to security risks due to the inherent
transferable nature of adversarial examples. Despite the success of recent generative model …

MVEB: Self-Supervised Learning With Multi-View Entropy Bottleneck

L Wen, X Wang, J Liu, Z Xu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Self-supervised learning aims to learn representation that can be effectively generalized to
downstream tasks. Many self-supervised approaches regard two views of an image as both …

TransformMix: Learning Transformation and Mixing Strategies from Data

TH Cheung, DY Yeung - arXiv preprint arXiv:2403.12429, 2024 - arxiv.org
Data augmentation improves the generalization power of deep learning models by
synthesizing more training samples. Sample-mixing is a popular data augmentation …

Efficient Image Retrieval Using Hierarchical K-Means Clustering

D Park, Y Hwang - Sensors, 2024 - mdpi.com
The objective of content-based image retrieval (CBIR) is to locate samples from a database
that are akin to a query, relying on the content embedded within the images. A contemporary …

On Pretraining Data Diversity for Self-Supervised Learning

HAAK Hammoud, T Das, F Pizzati, P Torr, A Bibi… - arXiv preprint arXiv …, 2024 - arxiv.org
We explore the impact of training with more diverse datasets, characterized by the number of
unique samples, on the performance of self-supervised learning (SSL) under a fixed …