Causal learning and explanation of deep neural networks via autoencoded activations

M Harradon, J Druce, B Ruttenberg - arXiv preprint arXiv:1802.00541, 2018 - arxiv.org
Deep neural networks are complex and opaque. As they enter application in a variety of
important and safety critical domains, users seek methods to explain their output predictions …

Res-tuning: A flexible and efficient tuning paradigm via unbinding tuner from backbone

Z Jiang, C Mao, Z Huang, A Ma, Y Lv… - Advances in …, 2024 - proceedings.neurips.cc
Parameter-efficient tuning has become a trend in transferring large-scale foundation models
to downstream applications. Existing methods typically embed some light-weight tuners into …

Lion: Implicit vision prompt tuning

H Wang, J Chang, Y Zhai, X Luo, J Sun, Z Lin… - Proceedings of the …, 2024 - ojs.aaai.org
Despite recent promising performances across a range of vision tasks, vision Transformers
still have an issue of high computational costs. Recently, vision prompt learning has …

A hierarchical grocery store image dataset with visual and semantic labels

M Klasson, C Zhang, H Kjellström - 2019 IEEE winter …, 2019 - ieeexplore.ieee.org
Image classification models built into visual support systems and other assistive devices
need to provide accurate predictions about their environment. We focus on an application of …

Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents

E Weber, DP Papadopoulos… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Natural disasters, such as floods, tornadoes, or wildfires, are increasingly pervasive as the
Earth undergoes global warming. It is difficult to predict when and where an incident will …

Prototyping a social media flooding photo screening system based on deep learning

H Ning, Z Li, ME Hodgson, C Wang - ISPRS international journal of geo …, 2020 - mdpi.com
This article aims to implement a prototype screening system to identify flooding-related
photos from social media. These photos, associated with their geographic locations, can …

Exploring fine-grained audiovisual categorization with the ssw60 dataset

G Van Horn, R Qian, K Wilber, H Adam… - … on Computer Vision, 2022 - Springer
We present a new benchmark dataset, Sapsucker Woods 60 (SSW60), for advancing
research on audiovisual fine-grained categorization. While our community has made great …

Deep listwise triplet hashing for fine-grained image retrieval

Y Liang, Y Pan, H Lai, W Liu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Hashing is a practical approach for the approximate nearest neighbor search. Deep hashing
methods, which train deep networks to generate compact and similarity-preserving binary …

Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?

C Han, Q Wang, Y Cui, W Wang, L Huang, S Qi… - arXiv preprint arXiv …, 2024 - arxiv.org
As the scale of vision models continues to grow, the emergence of Visual Prompt Tuning
(VPT) as a parameter-efficient transfer learning technique has gained attention due to its …

Dynamic tuning towards parameter and inference efficiency for vit adaptation

W Zhao, J Tang, Y Han, Y Song, K Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing parameter-efficient fine-tuning (PEFT) methods have achieved significant success
on vision transformers (ViTs) adaptation by improving parameter efficiency. However, the …