Fashion meets computer vision: A survey

WH Cheng, S Song, CY Chen, SC Hidayati… - ACM Computing Surveys …, 2021 - dl.acm.org
Fashion is the way we present ourselves to the world and has become one of the world's
largest industries. Fashion, mainly conveyed by vision, has thus attracted much attention …

Effective conditioned and composed image retrieval combining clip-based features

A Baldrati, M Bertini, T Uricchio… - Proceedings of the …, 2022 - openaccess.thecvf.com
Conditioned and composed image retrieval extend CBIR systems by combining a query
image with an additional text that expresses the intent of the user, describing additional …

Similarity reasoning and filtration for image-text matching

H Diao, Y Zhang, L Ma, H Lu - Proceedings of the AAAI conference on …, 2021 - ojs.aaai.org
Image-text matching plays a critical role in bridging the vision and language, and great
progress has been made by exploiting the global alignment between image and sentence …

Learning with noisy correspondence for cross-modal matching

Z Huang, G Niu, X Liu, W Ding… - Advances in Neural …, 2021 - proceedings.neurips.cc
Cross-modal matching, which aims to establish the correspondence between two different
modalities, is fundamental to a variety of tasks such as cross-modal retrieval and vision-and …

Artificial Intelligence in Business-to-Customer Fashion Retail: A Literature Review

A Goti, L Querejeta-Lomas, A Almeida, JG de la Puerta… - Mathematics, 2023 - mdpi.com
Many industries, including healthcare, banking, the auto industry, education, and retail, have
already undergone significant changes because of artificial intelligence (AI). Business-to …

Product1m: Towards weakly supervised instance-level product retrieval via cross-modal pretraining

X Zhan, Y Wu, X Dong, Y Wei, M Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Nowadays, customer's demands for E-commerce are more diversified, which introduces
more complications to the product retrieval industry. Previous methods are either subject to …

Unifying knowledge iterative dissemination and relational reconstruction network for image–text matching

X Xie, Z Li, Z Tang, D Yao, H Ma - Information Processing & Management, 2023 - Elsevier
Image–text matching is a crucial branch in multimedia retrieval which relies on learning inter-
modal correspondences. Most existing methods focus on global or local correspondence …

Plug-and-play regulators for image-text matching

H Diao, Y Zhang, W Liu, X Ruan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Exploiting fine-grained correspondence and visual-semantic alignments has shown great
potential in image-text matching. Generally, recent approaches first employ a cross-modal …

Texture and shape biased two-stream networks for clothing classification and attribute recognition

Y Zhang, P Zhang, C Yuan… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Clothes category classification and attribute recognition have achieved distinguished
success with the development of deep learning. People have found that landmark detection …

Fine-grained fashion similarity prediction by attribute-specific embedding learning

J Dong, Z Ma, X Mao, X Yang, Y He… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This paper strives to predict fine-grained fashion similarity. In this similarity paradigm, one
should pay more attention to the similarity in terms of a specific design/attribute between …