Multimodal co-learning: Challenges, applications with datasets, recent advances and future directions

A Rahate, R Walambe, S Ramanna, K Kotecha - Information Fusion, 2022 - Elsevier
Multimodal deep learning systems that employ multiple modalities like text, image, audio,
video, etc., are showing better performance than individual modalities (ie, unimodal) …

A systematic review of robustness in deep learning for computer vision: Mind the gap?

N Drenkow, N Sani, I Shpitser, M Unberath - arXiv preprint arXiv …, 2021 - arxiv.org
Deep neural networks for computer vision are deployed in increasingly safety-critical and
socially-impactful applications, motivating the need to close the gap in model performance …

Efficient test-time model adaptation without forgetting

S Niu, J Wu, Y Zhang, Y Chen… - International …, 2022 - proceedings.mlr.press
Test-time adaptation provides an effective means of tackling the potential distribution shift
between model training and inference, by dynamically updating the model at test time. This …

Tent: Fully test-time adaptation by entropy minimization

D Wang, E Shelhamer, S Liu, B Olshausen… - arXiv preprint arXiv …, 2020 - arxiv.org
A model must adapt itself to generalize to new and different data during testing. In this
setting of fully test-time adaptation the model has only the test data and its own parameters …

MedViT: a robust vision transformer for generalized medical image classification

ON Manzari, H Ahmadabadi, H Kashiani… - Computers in Biology …, 2023 - Elsevier
Abstract Convolutional Neural Networks (CNNs) have advanced existing medical systems
for automatic disease diagnosis. However, there are still concerns about the reliability of …

Back to the source: Diffusion-driven adaptation to test-time corruption

J Gao, J Zhang, X Liu, T Darrell… - Proceedings of the …, 2023 - openaccess.thecvf.com
Test-time adaptation harnesses test inputs to improve the accuracy of a model trained on
source data when tested on shifted target data. Most methods update the source model by …

3d common corruptions and data augmentation

OF Kar, T Yeo, A Atanov… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We introduce a set of image transformations that can be used as corruptions to evaluate the
robustness of models as well as data augmentation mechanisms for training neural …

Towards robust vision transformer

X Mao, G Qi, Y Chen, X Li, R Duan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract Recent advances on Vision Transformer (ViT) and its improved variants have
shown that self-attention-based networks surpass traditional Convolutional Neural Networks …

Augmax: Adversarial composition of random augmentations for robust training

H Wang, C Xiao, J Kossaifi, Z Yu… - Advances in neural …, 2021 - proceedings.neurips.cc
Data augmentation is a simple yet effective way to improve the robustness of deep neural
networks (DNNs). Diversity and hardness are two complementary dimensions of data …

Assaying out-of-distribution generalization in transfer learning

F Wenzel, A Dittadi, P Gehler… - Advances in …, 2022 - proceedings.neurips.cc
Since out-of-distribution generalization is a generally ill-posed problem, various proxy
targets (eg, calibration, adversarial robustness, algorithmic corruptions, invariance across …