Cross-modal implicit relation reasoning and aligning for text-to-image person retrieval

D Jiang, M Ye - Proceedings of the IEEE/CVF Conference …, 2023 - openaccess.thecvf.com
Text-to-image person retrieval aims to identify the target person based on a given textual
description query. The primary challenge is to learn the mapping of visual and textual …

Clip-driven fine-grained text-image person re-identification

S Yan, N Dong, L Zhang, J Tang - IEEE Transactions on Image …, 2023 - ieeexplore.ieee.org
Text-Image Person Re-identification (TIReID) aims to retrieve the image corresponding to
the given text query from a pool of candidate images. Existing methods employ prior …

Transformer for object re-identification: A survey

M Ye, S Chen, C Li, WS Zheng, D Crandall… - International Journal of …, 2024 - Springer
Abstract Object Re-identification (Re-ID) aims to identify specific objects across different
times and scenes, which is a widely researched task in computer vision. For a prolonged …

See finer, see more: Implicit modality alignment for text-based person retrieval

X Shu, W Wen, H Wu, K Chen, Y Song, R Qiao… - … on Computer Vision, 2022 - Springer
Text-based person retrieval aims to find the query person based on a textual description.
The key is to learn a common latent space mapping between visual-textual modalities. To …

Learning granularity-unified representations for text-to-image person re-identification

Z Shao, X Zhang, M Fang, Z Lin, J Wang… - Proceedings of the 30th …, 2022 - dl.acm.org
Text-to-image person re-identification (ReID) aims to search for pedestrian images of an
interested identity via textual descriptions. It is challenging due to both rich intra-modal …

Pedestrian-specific bipartite-aware similarity learning for text-based person retrieval

F Shen, X Shu, X Du, J Tang - Proceedings of the 31st ACM International …, 2023 - dl.acm.org
Text-based person retrieval is a challenging task that aims to search pedestrian images with
the same identity according to language descriptions. Current methods usually …

Towards unified text-based person retrieval: A large-scale multi-attribute and language search benchmark

S Yang, Y Zhou, Z Zheng, Y Wang, L Zhu… - Proceedings of the 31st …, 2023 - dl.acm.org
In this paper, we introduce a large Multi-Attribute and Language Search dataset for text-
based person retrieval, called MALS, and explore the feasibility of performing pre-training on …

Fashionvil: Fashion-focused vision-and-language representation learning

X Han, L Yu, X Zhu, L Zhang, YZ Song… - European conference on …, 2022 - Springer
Abstract Large-scale Vision-and-Language (V+ L) pre-training for representation learning
has proven to be effective in boosting various downstream V+ L tasks. However, when it …

Noisy-correspondence learning for text-to-image person re-identification

Y Qin, Y Chen, D Peng, X Peng… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-to-image person re-identification (TIReID) is a compelling topic in the cross-modal
community which aims to retrieve the target person based on a textual query. Although …

A simple and robust correlation filtering method for text-based person search

W Suo, M Sun, K Niu, Y Gao, P Wang, Y Zhang… - European conference on …, 2022 - Springer
Text-based person search aims to associate pedestrian images with natural language
descriptions. In this task, extracting differentiated representations and aligning them among …