Large inversion effects are not specific to faces and do not vary with object expertise

C Rezlescu, A Chapman, T Susilo… - Scientific …, 2016 - discovery.ucl.ac.uk
Visual object recognition is impaired when stimuli are shown upside-down. This
phenomenon is known as the inversion effect, and a substantial body of evidence suggests …

Dynamically reducing pressure on the physical register file through simple register sharing

L Tran, N Nelson, F Ngai, S Dropsho… - … Analysis of Systems …, 2004 - ieeexplore.ieee.org
Using register renaming and physical registers, modern microprocessors eliminate false
data dependences from reuse of the instruction set defined registers (logical registers). High …

TransformMix: Learning Transformation and Mixing Strategies from Data

TH Cheung, DY Yeung - arXiv preprint arXiv:2403.12429, 2024 - arxiv.org
Data augmentation improves the generalization power of deep learning models by
synthesizing more training samples. Sample-mixing is a popular data augmentation …

Efficient Image Retrieval Using Hierarchical K-Means Clustering

D Park, Y Hwang - Sensors, 2024 - mdpi.com
The objective of content-based image retrieval (CBIR) is to locate samples from a database
that are akin to a query, relying on the content embedded within the images. A contemporary …

On Pretraining Data Diversity for Self-Supervised Learning

HAAK Hammoud, T Das, F Pizzati, P Torr, A Bibi… - arXiv preprint arXiv …, 2024 - arxiv.org
We explore the impact of training with more diverse datasets, characterized by the number of
unique samples, on the performance of self-supervised learning (SSL) under a fixed …

Discrepant Semantic Diffusion Boosts Transfer Learning Robustness

Y Gao, S Bai, X Zhao, R Gong, Y Wu, Y Ma - Electronics, 2023 - mdpi.com
Transfer learning could improve the robustness and generalization of the model, reducing
potential privacy and security risks. It operates by fine-tuning a pre-trained model on …

Fine-grained image analysis via progressive feature learning

Y Yan, B Ni, H Wei, X Yang - Neurocomputing, 2020 - Elsevier
Due to large intra-class variation and inter-class ambiguity, fine-grained object recognition
has been a challenging task for decades. A good approach should be able to:(1) discover …

Neural Tree Decoder for Interpretation of Vision Transformers

S Kim, BC Ko - IEEE Transactions on Artificial Intelligence, 2023 - ieeexplore.ieee.org
In this study, we propose a novel vision transformer neural tree decoder (ViT-NeT) that is
interpretable and highly accurate in terms of fine-grained visual categorization (FGVC). A …

Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling

C Rodriguez-Opazo, E Abbasnejad, D Teney… - arXiv preprint arXiv …, 2024 - arxiv.org
Contrastive Language-Image Pretraining (CLIP) stands out as a prominent method for
image representation learning. Various architectures, from vision transformers (ViTs) to …

Facial Image Feature Analysis and its Specialization for Fr\'echet Distance and Neighborhoods

D Cetin, B Schesch, P Stamenkovic, NB Huber… - arXiv preprint arXiv …, 2024 - arxiv.org
Assessing distances between images and image datasets is a fundamental task in vision-
based research. It is a challenging open problem in the literature and despite the criticism it …