Training vision transformers for image retrieval

A El-Nouby, N Neverova, I Laptev, H Jégou - arXiv preprint arXiv …, 2021 - arxiv.org
… and, more recently, for image classification. We here extend this work and propose a
transformer-based approach for image retrieval: we adopt vision transformers for generating image

Boosting vision transformers for image retrieval

CH Song, J Yoon, S Choi… - … of Computer Vision, 2023 - openaccess.thecvf.com
Image retrieval using vision transformers In Table 2, we compare with the few previous
approaches using vision transformers as backbones for image retrieval… same transformer encoder …

Investigating the vision transformer model for image retrieval tasks

S Gkelios, Y Boutalis… - 2021 17th International …, 2021 - ieeexplore.ieee.org
… upon the Vision Transformer architecture to shape a global descriptor for image retrieval
tasks… Following the procedure that the vast majority of the deeplearning-based image retrieval

HashFormer: Vision transformer based deep hashing for image retrieval

T Li, Z Zhang, L Pei, Y Gan - IEEE Signal Processing Letters, 2022 - ieeexplore.ieee.org
… of vision transformers, we propose a pure transformer-based framework, called as HashFormer,
to tackle the deep hashing task. Specifically, we utilize vision transformer (… task, ie, image

Vision transformer hashing for image retrieval

SR Dubey, SK Singh, WT Chu - 2022 IEEE international …, 2022 - ieeexplore.ieee.org
… Recently, Transformer has emerged as a new architecture in … Transformer is also extended
to Vision Transformer (ViT) for … a Vision Transformer Hashing (VTS) for image retrieval. We …

Image retrieval using convolutional autoencoder, infogan, and vision transformer unsupervised models

ES Sabry, SS Elagooz, FE Abd El-Samie… - IEEE …, 2023 - ieeexplore.ieee.org
… from a query image and their correspondence in … retrieval problems is Facial Sketched-Real
Image Retrieval (FSRIR), which is content similarity matching based. These facial retrieving

Multi-modal transformer with global-local alignment for composed query image retrieval

Y Xu, Y Bin, J Wei, Y Yang, G Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
image retrieval, which aims at retrieving the target image similar to the composed query, ie,
a reference image … We leverage the vision Transformer and language Transformer to encode …

Instance-level image retrieval using reranking transformers

F Tan, J Yuan, V Ordonez - … conference on computer vision, 2021 - openaccess.thecvf.com
… We introduce Reranking Transformers (RRTs) for instance image retrieval. We show that
RRTs outperform prior reranking approaches across a variety of settings. Compared to …

Transhash: Transformer-based hamming hashing for efficient image retrieval

Y Chen, S Zhang, F Liu, Z Chang, M Ye… - … on multimedia retrieval, 2022 - dl.acm.org
… advancements of vision transformers, we present Transhash, a pure transformer-based …
major modules: (1) Based on Vision Transformer (ViT), we design a siamese Multi-Granular …

[PDF][PDF] SwinFGHash: Fine-grained Image Retrieval via Transformer-based Hashing Network.

D Lu, J Wang, Z Zeng, B Chen, S Wu… - BMVC, 2021 - bmvc2021-virtualconference.com
… step to exploit the vision transformer-based hashing network for fine-grained image retrieval.
We propose the SwinFGHash, which takes advantage of transformer-based architecture to …